Dataset statistics
| Number of variables | 29 |
|---|---|
| Number of observations | 128367 |
| Missing cells | 1123851 |
| Missing cells (%) | 30.2% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 122.5 MiB |
| Average record size in memory | 1000.4 B |
Variable types
| Categorical | 22 |
|---|---|
| Numeric | 7 |
CRASH DATE has a high cardinality: 430 distinct values | High cardinality |
CRASH TIME has a high cardinality: 1440 distinct values | High cardinality |
LOCATION has a high cardinality: 52867 distinct values | High cardinality |
ON STREET NAME has a high cardinality: 4776 distinct values | High cardinality |
CROSS STREET NAME has a high cardinality: 5270 distinct values | High cardinality |
OFF STREET NAME has a high cardinality: 28958 distinct values | High cardinality |
CONTRIBUTING FACTOR VEHICLE 1 has a high cardinality: 55 distinct values | High cardinality |
VEHICLE TYPE CODE 1 has a high cardinality: 414 distinct values | High cardinality |
VEHICLE TYPE CODE 2 has a high cardinality: 427 distinct values | High cardinality |
VEHICLE TYPE CODE 3 has a high cardinality: 77 distinct values | High cardinality |
LATITUDE is highly correlated with LONGITUDE | High correlation |
LONGITUDE is highly correlated with LATITUDE | High correlation |
NUMBER OF PERSONS INJURED is highly correlated with NUMBER OF MOTORIST INJURED | High correlation |
NUMBER OF MOTORIST INJURED is highly correlated with NUMBER OF PERSONS INJURED | High correlation |
NUMBER OF MOTORIST KILLED is highly correlated with NUMBER OF PERSONS KILLED | High correlation |
NUMBER OF PERSONS KILLED is highly correlated with NUMBER OF MOTORIST KILLED | High correlation |
BOROUGH has 44591 (34.7%) missing values | Missing |
ZIP CODE has 44600 (34.7%) missing values | Missing |
LATITUDE has 10096 (7.9%) missing values | Missing |
LONGITUDE has 10096 (7.9%) missing values | Missing |
LOCATION has 10096 (7.9%) missing values | Missing |
ON STREET NAME has 33769 (26.3%) missing values | Missing |
CROSS STREET NAME has 68115 (53.1%) missing values | Missing |
OFF STREET NAME has 94598 (73.7%) missing values | Missing |
CONTRIBUTING FACTOR VEHICLE 2 has 28507 (22.2%) missing values | Missing |
CONTRIBUTING FACTOR VEHICLE 3 has 116070 (90.4%) missing values | Missing |
CONTRIBUTING FACTOR VEHICLE 4 has 125009 (97.4%) missing values | Missing |
CONTRIBUTING FACTOR VEHICLE 5 has 127377 (99.2%) missing values | Missing |
VEHICLE TYPE CODE 2 has 39742 (31.0%) missing values | Missing |
VEHICLE TYPE CODE 3 has 116772 (91.0%) missing values | Missing |
VEHICLE TYPE CODE 4 has 125156 (97.5%) missing values | Missing |
VEHICLE TYPE CODE 5 has 127407 (99.3%) missing values | Missing |
LATITUDE is highly skewed (γ1 = -26.76189702) | Skewed |
LONGITUDE is highly skewed (γ1 = 26.84483109) | Skewed |
OFF STREET NAME is uniformly distributed | Uniform |
COLLISION_ID has unique values | Unique |
NUMBER OF PERSONS INJURED has 90759 (70.7%) zeros | Zeros |
NUMBER OF PEDESTRIANS INJURED has 120992 (94.3%) zeros | Zeros |
NUMBER OF MOTORIST INJURED has 103780 (80.8%) zeros | Zeros |
Reproduction
| Analysis started | 2021-03-16 03:34:57.922404 |
|---|---|
| Analysis finished | 2021-03-16 03:35:53.489375 |
| Duration | 55.57 seconds |
| Software version | pandas-profiling v2.11.0 |
| Download configuration | config.yaml |
| Distinct | 430 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 8.2 MiB |
| 01/18/2020 | 774 |
|---|---|
| 03/06/2020 | 673 |
| 02/14/2020 | 632 |
| 02/07/2020 | 604 |
| 02/27/2020 | 581 |
| Other values (425) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 1283670 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 01/02/2020 |
|---|---|
| 2nd row | 01/02/2020 |
| 3rd row | 01/02/2020 |
| 4th row | 01/02/2020 |
| 5th row | 01/02/2020 |
| Value | Count | Frequency (%) |
| 01/18/2020 | 774 | 0.6% |
| 03/06/2020 | 673 | 0.5% |
| 02/14/2020 | 632 | 0.5% |
| 02/07/2020 | 604 | 0.5% |
| 02/27/2020 | 581 | 0.5% |
| 02/10/2020 | 572 | 0.4% |
| 01/17/2020 | 566 | 0.4% |
| 03/02/2020 | 562 | 0.4% |
| 02/03/2020 | 559 | 0.4% |
| 01/21/2020 | 556 | 0.4% |
| Other values (420) | 122288 |
| Value | Count | Frequency (%) |
| 01/18/2020 | 774 | 0.6% |
| 03/06/2020 | 673 | 0.5% |
| 02/14/2020 | 632 | 0.5% |
| 02/07/2020 | 604 | 0.5% |
| 02/27/2020 | 581 | 0.5% |
| 02/10/2020 | 572 | 0.4% |
| 01/17/2020 | 566 | 0.4% |
| 03/02/2020 | 562 | 0.4% |
| 02/03/2020 | 559 | 0.4% |
| 01/21/2020 | 556 | 0.4% |
| Other values (420) | 122288 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 403631 | |
| 2 | 338480 | |
| / | 256734 | |
| 1 | 130923 | 10.2% |
| 3 | 31595 | 2.5% |
| 8 | 22285 | 1.7% |
| 7 | 21701 | 1.7% |
| 9 | 21656 | 1.7% |
| 6 | 20649 | 1.6% |
| 5 | 18632 | 1.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1026936 | |
| Other Punctuation | 256734 | 20.0% |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 403631 | |
| 2 | 338480 | |
| 1 | 130923 | 12.7% |
| 3 | 31595 | 3.1% |
| 8 | 22285 | 2.2% |
| 7 | 21701 | 2.1% |
| 9 | 21656 | 2.1% |
| 6 | 20649 | 2.0% |
| 5 | 18632 | 1.8% |
| 4 | 17384 | 1.7% |
| Value | Count | Frequency (%) |
| / | 256734 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1283670 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 403631 | |
| 2 | 338480 | |
| / | 256734 | |
| 1 | 130923 | 10.2% |
| 3 | 31595 | 2.5% |
| 8 | 22285 | 1.7% |
| 7 | 21701 | 1.7% |
| 9 | 21656 | 1.7% |
| 6 | 20649 | 1.6% |
| 5 | 18632 | 1.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1283670 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 403631 | |
| 2 | 338480 | |
| / | 256734 | |
| 1 | 130923 | 10.2% |
| 3 | 31595 | 2.5% |
| 8 | 22285 | 1.7% |
| 7 | 21701 | 1.7% |
| 9 | 21656 | 1.7% |
| 6 | 20649 | 1.6% |
| 5 | 18632 | 1.5% |
| Distinct | 1440 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.6 MiB |
| 0:00 | 2110 |
|---|---|
| 15:00 | 1572 |
| 16:00 | 1521 |
| 14:00 | 1453 |
| 17:00 | 1445 |
| Other values (1435) |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.732259849 |
| Min length | 4 |
Characters and Unicode
| Total characters | 607466 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0:00 |
|---|---|
| 2nd row | 12:57 |
| 3rd row | 15:00 |
| 4th row | 15:10 |
| 5th row | 17:30 |
| Value | Count | Frequency (%) |
| 0:00 | 2110 | 1.6% |
| 15:00 | 1572 | 1.2% |
| 16:00 | 1521 | 1.2% |
| 14:00 | 1453 | 1.1% |
| 17:00 | 1445 | 1.1% |
| 13:00 | 1418 | 1.1% |
| 18:00 | 1360 | 1.1% |
| 12:00 | 1359 | 1.1% |
| 10:00 | 1251 | 1.0% |
| 9:00 | 1182 | 0.9% |
| Other values (1430) | 113696 |
| Value | Count | Frequency (%) |
| 0:00 | 2110 | 1.6% |
| 15:00 | 1572 | 1.2% |
| 16:00 | 1521 | 1.2% |
| 14:00 | 1453 | 1.1% |
| 17:00 | 1445 | 1.1% |
| 13:00 | 1418 | 1.1% |
| 18:00 | 1360 | 1.1% |
| 12:00 | 1359 | 1.1% |
| 10:00 | 1251 | 1.0% |
| 9:00 | 1182 | 0.9% |
| Other values (1430) | 113696 |
Most occurring characters
| Value | Count | Frequency (%) |
| : | 128367 | |
| 0 | 114253 | |
| 1 | 111181 | |
| 2 | 53789 | |
| 5 | 53150 | |
| 3 | 42929 | 7.1% |
| 4 | 34124 | 5.6% |
| 8 | 19275 | 3.2% |
| 7 | 17325 | 2.9% |
| 9 | 16636 | 2.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 479099 | |
| Other Punctuation | 128367 | 21.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 114253 | |
| 1 | 111181 | |
| 2 | 53789 | |
| 5 | 53150 | |
| 3 | 42929 | 9.0% |
| 4 | 34124 | 7.1% |
| 8 | 19275 | 4.0% |
| 7 | 17325 | 3.6% |
| 9 | 16636 | 3.5% |
| 6 | 16437 | 3.4% |
| Value | Count | Frequency (%) |
| : | 128367 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 607466 |
Most frequent character per script
| Value | Count | Frequency (%) |
| : | 128367 | |
| 0 | 114253 | |
| 1 | 111181 | |
| 2 | 53789 | |
| 5 | 53150 | |
| 3 | 42929 | 7.1% |
| 4 | 34124 | 5.6% |
| 8 | 19275 | 3.2% |
| 7 | 17325 | 2.9% |
| 9 | 16636 | 2.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 607466 |
Most frequent character per block
| Value | Count | Frequency (%) |
| : | 128367 | |
| 0 | 114253 | |
| 1 | 111181 | |
| 2 | 53789 | |
| 5 | 53150 | |
| 3 | 42929 | 7.1% |
| 4 | 34124 | 5.6% |
| 8 | 19275 | 3.2% |
| 7 | 17325 | 2.9% |
| 9 | 16636 | 2.7% |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 44591 |
| Missing (%) | 34.7% |
| Memory size | 6.5 MiB |
| BROOKLYN | |
|---|---|
| QUEENS | |
| BRONX | |
| MANHATTAN | |
| STATEN ISLAND | 2798 |
Length
| Max length | 13 |
|---|---|
| Median length | 8 |
| Mean length | 7.171886937 |
| Min length | 5 |
Characters and Unicode
| Total characters | 600832 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | BROOKLYN |
|---|---|
| 2nd row | BRONX |
| 3rd row | MANHATTAN |
| 4th row | BROOKLYN |
| 5th row | BROOKLYN |
| Value | Count | Frequency (%) |
| BROOKLYN | 29091 | |
| QUEENS | 23503 | |
| BRONX | 16186 | 12.6% |
| MANHATTAN | 12198 | 9.5% |
| STATEN ISLAND | 2798 | 2.2% |
| (Missing) | 44591 |
| Value | Count | Frequency (%) |
| brooklyn | 29091 | |
| queens | 23503 | |
| bronx | 16186 | |
| manhattan | 12198 | |
| island | 2798 | 3.2% |
| staten | 2798 | 3.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 98772 | |
| O | 74368 | |
| E | 49804 | 8.3% |
| B | 45277 | 7.5% |
| R | 45277 | 7.5% |
| A | 42190 | 7.0% |
| L | 31889 | 5.3% |
| T | 29992 | 5.0% |
| S | 29099 | 4.8% |
| K | 29091 | 4.8% |
| Other values (9) | 125073 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 598034 | |
| Space Separator | 2798 | 0.5% |
Most frequent character per category
| Value | Count | Frequency (%) |
| N | 98772 | |
| O | 74368 | |
| E | 49804 | |
| B | 45277 | 7.6% |
| R | 45277 | 7.6% |
| A | 42190 | 7.1% |
| L | 31889 | 5.3% |
| T | 29992 | 5.0% |
| S | 29099 | 4.9% |
| K | 29091 | 4.9% |
| Other values (8) | 122275 |
| Value | Count | Frequency (%) |
| 2798 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 598034 | |
| Common | 2798 | 0.5% |
Most frequent character per script
| Value | Count | Frequency (%) |
| N | 98772 | |
| O | 74368 | |
| E | 49804 | |
| B | 45277 | 7.6% |
| R | 45277 | 7.6% |
| A | 42190 | 7.1% |
| L | 31889 | 5.3% |
| T | 29992 | 5.0% |
| S | 29099 | 4.9% |
| K | 29091 | 4.9% |
| Other values (8) | 122275 |
| Value | Count | Frequency (%) |
| 2798 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 600832 |
Most frequent character per block
| Value | Count | Frequency (%) |
| N | 98772 | |
| O | 74368 | |
| E | 49804 | 8.3% |
| B | 45277 | 7.5% |
| R | 45277 | 7.5% |
| A | 42190 | 7.0% |
| L | 31889 | 5.3% |
| T | 29992 | 5.0% |
| S | 29099 | 4.8% |
| K | 29091 | 4.8% |
| Other values (9) | 125073 |
| Distinct | 204 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 44600 |
| Missing (%) | 34.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10912.96565 |
|---|---|
| Minimum | 10000 |
| Maximum | 11697 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1003.0 KiB |
Quantile statistics
| Minimum | 10000 |
|---|---|
| 5-th percentile | 10016 |
| Q1 | 10458 |
| median | 11210 |
| Q3 | 11354 |
| 95-th percentile | 11432 |
| Maximum | 11697 |
| Range | 1697 |
| Interquartile range (IQR) | 896 |
Descriptive statistics
| Standard deviation | 513.1528987 |
|---|---|
| Coefficient of variation (CV) | 0.04702231409 |
| Kurtosis | -1.21036805 |
| Mean | 10912.96565 |
| Median Absolute Deviation (MAD) | 202 |
| Skewness | -0.6206983569 |
| Sum | 914146394 |
| Variance | 263325.8975 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 11207 | 2038 | 1.6% |
| 11236 | 1596 | 1.2% |
| 11212 | 1518 | 1.2% |
| 11208 | 1415 | 1.1% |
| 11203 | 1355 | 1.1% |
| 11385 | 1317 | 1.0% |
| 11234 | 1238 | 1.0% |
| 11434 | 1201 | 0.9% |
| 11226 | 1171 | 0.9% |
| 11233 | 1097 | 0.9% |
| Other values (194) | 69821 | |
| (Missing) | 44600 |
| Value | Count | Frequency (%) |
| 10000 | 19 | < 0.1% |
| 10001 | 488 | |
| 10002 | 699 | |
| 10003 | 361 | |
| 10004 | 65 | 0.1% |
| 10005 | 44 | < 0.1% |
| 10006 | 52 | < 0.1% |
| 10007 | 150 | 0.1% |
| 10009 | 271 | 0.2% |
| 10010 | 302 |
| Value | Count | Frequency (%) |
| 11697 | 11 | < 0.1% |
| 11695 | 1 | < 0.1% |
| 11694 | 132 | 0.1% |
| 11693 | 124 | 0.1% |
| 11692 | 143 | 0.1% |
| 11691 | 613 | |
| 11436 | 256 | 0.2% |
| 11435 | 669 | |
| 11434 | 1201 | |
| 11433 | 512 |
| Distinct | 38213 |
|---|---|
| Distinct (%) | 32.3% |
| Missing | 10096 |
| Missing (%) | 7.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 40.67035821 |
|---|---|
| Minimum | 0 |
| Maximum | 40.912884 |
| Zeros | 163 |
| Zeros (%) | 0.1% |
| Memory size | 1003.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 40.59918 |
| Q1 | 40.66645 |
| median | 40.71532 |
| Q3 | 40.791084 |
| 95-th percentile | 40.86544 |
| Maximum | 40.912884 |
| Range | 40.912884 |
| Interquartile range (IQR) | 0.124634 |
Descriptive statistics
| Standard deviation | 1.513132419 |
|---|---|
| Coefficient of variation (CV) | 0.03720479695 |
| Kurtosis | 716.3483792 |
| Mean | 40.67035821 |
| Median Absolute Deviation (MAD) | 0.055432 |
| Skewness | -26.76189702 |
| Sum | 4810123.935 |
| Variance | 2.289569717 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 163 | 0.1% |
| 40.861862 | 130 | 0.1% |
| 40.820305 | 70 | 0.1% |
| 40.675735 | 65 | 0.1% |
| 40.651863 | 64 | < 0.1% |
| 40.65965 | 63 | < 0.1% |
| 40.69168 | 62 | < 0.1% |
| 40.696033 | 60 | < 0.1% |
| 40.8047 | 60 | < 0.1% |
| 40.83801 | 58 | < 0.1% |
| Other values (38203) | 117476 | |
| (Missing) | 10096 | 7.9% |
| Value | Count | Frequency (%) |
| 0 | 163 | |
| 40.504116 | 1 | < 0.1% |
| 40.50447 | 1 | < 0.1% |
| 40.504482 | 1 | < 0.1% |
| 40.50465 | 1 | < 0.1% |
| 40.505062 | 1 | < 0.1% |
| 40.50526 | 1 | < 0.1% |
| 40.506187 | 2 | < 0.1% |
| 40.50667 | 1 | < 0.1% |
| 40.506756 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 40.912884 | 1 | |
| 40.912468 | 1 | |
| 40.91222 | 1 | |
| 40.91217 | 1 | |
| 40.912018 | 1 | |
| 40.911667 | 1 | |
| 40.911068 | 1 | |
| 40.9109 | 1 | |
| 40.91076 | 1 | |
| 40.91038 | 1 |
| Distinct | 29126 |
|---|---|
| Distinct (%) | 24.6% |
| Missing | 10096 |
| Missing (%) | 7.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -73.81002768 |
|---|---|
| Minimum | -74.253006 |
| Maximum | 0 |
| Zeros | 163 |
| Zeros (%) | 0.1% |
| Memory size | 1003.0 KiB |
Quantile statistics
| Minimum | -74.253006 |
|---|---|
| 5-th percentile | -74.018845 |
| Q1 | -73.95836 |
| median | -73.91696 |
| Q3 | -73.86384 |
| 95-th percentile | -73.76084 |
| Maximum | 0 |
| Range | 74.253006 |
| Interquartile range (IQR) | 0.09452 |
Descriptive statistics
| Standard deviation | 2.743264834 |
|---|---|
| Coefficient of variation (CV) | -0.03716656015 |
| Kurtosis | 719.3131649 |
| Mean | -73.81002768 |
| Median Absolute Deviation (MAD) | 0.046434 |
| Skewness | 26.84483109 |
| Sum | -8729585.784 |
| Variance | 7.525501947 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 163 | 0.1% |
| -73.91282 | 134 | 0.1% |
| -73.89083 | 80 | 0.1% |
| -73.89063 | 76 | 0.1% |
| -73.86536 | 73 | 0.1% |
| -73.89686 | 72 | 0.1% |
| -73.91243 | 70 | 0.1% |
| -73.98453 | 66 | 0.1% |
| -73.773834 | 62 | < 0.1% |
| -73.93755 | 61 | < 0.1% |
| Other values (29116) | 117414 | |
| (Missing) | 10096 | 7.9% |
| Value | Count | Frequency (%) |
| -74.253006 | 1 | < 0.1% |
| -74.25076 | 1 | < 0.1% |
| -74.25047 | 1 | < 0.1% |
| -74.25015 | 1 | < 0.1% |
| -74.24976 | 2 | |
| -74.24949 | 1 | < 0.1% |
| -74.24941 | 1 | < 0.1% |
| -74.248886 | 1 | < 0.1% |
| -74.24857 | 3 | |
| -74.24828 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 163 | |
| -73.700584 | 1 | < 0.1% |
| -73.70099 | 2 | < 0.1% |
| -73.70129 | 1 | < 0.1% |
| -73.7015 | 1 | < 0.1% |
| -73.70174 | 3 | < 0.1% |
| -73.70177 | 1 | < 0.1% |
| -73.70191 | 1 | < 0.1% |
| -73.70192 | 3 | < 0.1% |
| -73.70194 | 1 | < 0.1% |
| Distinct | 52867 |
|---|---|
| Distinct (%) | 44.7% |
| Missing | 10096 |
| Missing (%) | 7.9% |
| Memory size | 9.2 MiB |
| (0.0, 0.0) | 163 |
|---|---|
| (40.861862, -73.91282) | 129 |
| (40.820305, -73.89083) | 70 |
| (40.675735, -73.89686) | 65 |
| (40.65965, -73.773834) | 62 |
| Other values (52862) |
Length
| Max length | 23 |
|---|---|
| Median length | 22 |
| Mean length | 21.71183976 |
| Min length | 10 |
Characters and Unicode
| Total characters | 2567881 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 6 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 33635 ? |
|---|---|
| Unique (%) | 28.4% |
Sample
| 1st row | (40.668266, -73.84214) |
|---|---|
| 2nd row | (40.700527, -73.94161) |
| 3rd row | (40.843033, -73.881805) |
| 4th row | (40.75974, -73.97423) |
| 5th row | (40.74955, -74.00654) |
| Value | Count | Frequency (%) |
| (0.0, 0.0) | 163 | 0.1% |
| (40.861862, -73.91282) | 129 | 0.1% |
| (40.820305, -73.89083) | 70 | 0.1% |
| (40.675735, -73.89686) | 65 | 0.1% |
| (40.65965, -73.773834) | 62 | < 0.1% |
| (40.696033, -73.98453) | 60 | < 0.1% |
| (40.8047, -73.91243) | 60 | < 0.1% |
| (40.651863, -73.86536) | 59 | < 0.1% |
| (40.83801, -73.87329) | 56 | < 0.1% |
| (40.668495, -73.925606) | 56 | < 0.1% |
| Other values (52857) | 117491 | |
| (Missing) | 10096 | 7.9% |
| Value | Count | Frequency (%) |
| 0.0 | 326 | 0.1% |
| 73.91282 | 134 | 0.1% |
| 40.861862 | 130 | 0.1% |
| 73.89083 | 80 | < 0.1% |
| 73.89063 | 76 | < 0.1% |
| 73.86536 | 73 | < 0.1% |
| 73.89686 | 72 | < 0.1% |
| 40.820305 | 70 | < 0.1% |
| 73.91243 | 70 | < 0.1% |
| 73.98453 | 66 | < 0.1% |
| Other values (67328) | 235445 |
Most occurring characters
| Value | Count | Frequency (%) |
| 7 | 275948 | |
| 4 | 244229 | 9.5% |
| . | 236542 | 9.2% |
| 3 | 215022 | 8.4% |
| 0 | 204443 | 8.0% |
| 8 | 163070 | 6.4% |
| 6 | 161196 | 6.3% |
| 9 | 155051 | 6.0% |
| 5 | 124712 | 4.9% |
| ( | 118271 | 4.6% |
| Other values (6) | 669397 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1740147 | |
| Other Punctuation | 354813 | 13.8% |
| Open Punctuation | 118271 | 4.6% |
| Space Separator | 118271 | 4.6% |
| Close Punctuation | 118271 | 4.6% |
| Dash Punctuation | 118108 | 4.6% |
Most frequent character per category
| Value | Count | Frequency (%) |
| 7 | 275948 | |
| 4 | 244229 | |
| 3 | 215022 | |
| 0 | 204443 | |
| 8 | 163070 | |
| 6 | 161196 | |
| 9 | 155051 | |
| 5 | 124712 | |
| 2 | 99859 | 5.7% |
| 1 | 96617 | 5.6% |
| Value | Count | Frequency (%) |
| . | 236542 | |
| , | 118271 |
| Value | Count | Frequency (%) |
| ( | 118271 |
| Value | Count | Frequency (%) |
| 118271 |
| Value | Count | Frequency (%) |
| - | 118108 |
| Value | Count | Frequency (%) |
| ) | 118271 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2567881 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 7 | 275948 | |
| 4 | 244229 | 9.5% |
| . | 236542 | 9.2% |
| 3 | 215022 | 8.4% |
| 0 | 204443 | 8.0% |
| 8 | 163070 | 6.4% |
| 6 | 161196 | 6.3% |
| 9 | 155051 | 6.0% |
| 5 | 124712 | 4.9% |
| ( | 118271 | 4.6% |
| Other values (6) | 669397 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2567881 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 7 | 275948 | |
| 4 | 244229 | 9.5% |
| . | 236542 | 9.2% |
| 3 | 215022 | 8.4% |
| 0 | 204443 | 8.0% |
| 8 | 163070 | 6.4% |
| 6 | 161196 | 6.3% |
| 9 | 155051 | 6.0% |
| 5 | 124712 | 4.9% |
| ( | 118271 | 4.6% |
| Other values (6) | 669397 |
| Distinct | 4776 |
|---|---|
| Distinct (%) | 5.0% |
| Missing | 33769 |
| Missing (%) | 26.3% |
| Memory size | 9.1 MiB |
| BELT PARKWAY | 2211 |
|---|---|
| LONG ISLAND EXPRESSWAY | 1248 |
| BROOKLYN QUEENS EXPRESSWAY | 1204 |
| FDR DRIVE | 1202 |
| BROADWAY | 1022 |
| Other values (4771) |
Length
| Max length | 32 |
|---|---|
| Median length | 32 |
| Mean length | 32 |
| Min length | 32 |
Characters and Unicode
| Total characters | 3027136 |
|---|---|
| Distinct characters | 70 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1579 ? |
|---|---|
| Unique (%) | 1.7% |
Sample
| 1st row | CROSS ISLAND PARKWAY |
|---|---|
| 2nd row | W 57 & 8th Ave |
| 3rd row | CROSS BAY BOULEVARD |
| 4th row | NORTHERN BOULEVARD |
| 5th row | EAST 53 STREET |
| Value | Count | Frequency (%) |
| BELT PARKWAY | 2211 | 1.7% |
| LONG ISLAND EXPRESSWAY | 1248 | 1.0% |
| BROOKLYN QUEENS EXPRESSWAY | 1204 | 0.9% |
| FDR DRIVE | 1202 | 0.9% |
| BROADWAY | 1022 | 0.8% |
| MAJOR DEEGAN EXPRESSWAY | 1012 | 0.8% |
| GRAND CENTRAL PKWY | 996 | 0.8% |
| CROSS BRONX EXPY | 931 | 0.7% |
| ATLANTIC AVENUE | 910 | 0.7% |
| CROSS ISLAND PARKWAY | 880 | 0.7% |
| Other values (4766) | 82982 | |
| (Missing) | 33769 |
| Value | Count | Frequency (%) |
| avenue | 33517 | 15.2% |
| street | 25628 | 11.6% |
| east | 7620 | 3.5% |
| parkway | 6644 | 3.0% |
| boulevard | 6578 | 3.0% |
| expressway | 6254 | 2.8% |
| west | 4947 | 2.2% |
| road | 3570 | 1.6% |
| island | 2678 | 1.2% |
| cross | 2303 | 1.0% |
| Other values (2586) | 120719 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1778461 | ||
| E | 208548 | 6.9% |
| A | 118961 | 3.9% |
| R | 104027 | 3.4% |
| T | 94495 | 3.1% |
| N | 86853 | 2.9% |
| S | 82954 | 2.7% |
| U | 55287 | 1.8% |
| O | 53205 | 1.8% |
| V | 48789 | 1.6% |
| Other values (60) | 395556 | 13.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Space Separator | 1778461 | |
| Uppercase Letter | 1180929 | |
| Decimal Number | 60124 | 2.0% |
| Lowercase Letter | 6613 | 0.2% |
| Other Punctuation | 338 | < 0.1% |
| Open Punctuation | 334 | < 0.1% |
| Close Punctuation | 334 | < 0.1% |
| Dash Punctuation | 3 | < 0.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| E | 208548 | |
| A | 118961 | |
| R | 104027 | 8.8% |
| T | 94495 | 8.0% |
| N | 86853 | 7.4% |
| S | 82954 | 7.0% |
| U | 55287 | 4.7% |
| O | 53205 | 4.5% |
| V | 48789 | 4.1% |
| L | 39006 | 3.3% |
| Other values (16) | 288804 |
| Value | Count | Frequency (%) |
| e | 1093 | |
| a | 652 | 9.9% |
| t | 627 | 9.5% |
| r | 599 | 9.1% |
| n | 448 | 6.8% |
| s | 419 | 6.3% |
| o | 341 | 5.2% |
| v | 263 | 4.0% |
| u | 262 | 4.0% |
| l | 248 | 3.8% |
| Other values (16) | 1661 |
| Value | Count | Frequency (%) |
| 1 | 15057 | |
| 2 | 6683 | |
| 3 | 6542 | |
| 4 | 5249 | 8.7% |
| 5 | 5088 | 8.5% |
| 6 | 4705 | 7.8% |
| 8 | 4696 | 7.8% |
| 7 | 4272 | 7.1% |
| 0 | 3925 | 6.5% |
| 9 | 3907 | 6.5% |
| Value | Count | Frequency (%) |
| . | 257 | |
| / | 66 | 19.5% |
| & | 12 | 3.6% |
| ' | 3 | 0.9% |
| Value | Count | Frequency (%) |
| 1778461 |
| Value | Count | Frequency (%) |
| - | 3 |
| Value | Count | Frequency (%) |
| ( | 334 |
| Value | Count | Frequency (%) |
| ) | 334 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1839594 | |
| Latin | 1187542 |
Most frequent character per script
| Value | Count | Frequency (%) |
| E | 208548 | |
| A | 118961 | |
| R | 104027 | 8.8% |
| T | 94495 | 8.0% |
| N | 86853 | 7.3% |
| S | 82954 | 7.0% |
| U | 55287 | 4.7% |
| O | 53205 | 4.5% |
| V | 48789 | 4.1% |
| L | 39006 | 3.3% |
| Other values (42) | 295417 |
| Value | Count | Frequency (%) |
| 1778461 | ||
| 1 | 15057 | 0.8% |
| 2 | 6683 | 0.4% |
| 3 | 6542 | 0.4% |
| 4 | 5249 | 0.3% |
| 5 | 5088 | 0.3% |
| 6 | 4705 | 0.3% |
| 8 | 4696 | 0.3% |
| 7 | 4272 | 0.2% |
| 0 | 3925 | 0.2% |
| Other values (8) | 4916 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3027136 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 1778461 | ||
| E | 208548 | 6.9% |
| A | 118961 | 3.9% |
| R | 104027 | 3.4% |
| T | 94495 | 3.1% |
| N | 86853 | 2.9% |
| S | 82954 | 2.7% |
| U | 55287 | 1.8% |
| O | 53205 | 1.8% |
| V | 48789 | 1.6% |
| Other values (60) | 395556 | 13.1% |
| Distinct | 5270 |
|---|---|
| Distinct (%) | 8.7% |
| Missing | 68115 |
| Missing (%) | 53.1% |
| Memory size | 6.1 MiB |
| 3 AVENUE | 562 |
|---|---|
| BROADWAY | 523 |
| 2 AVENUE | 387 |
| LINDEN BOULEVARD | 346 |
| 5 AVENUE | 319 |
| Other values (5265) |
Length
| Max length | 32 |
|---|---|
| Median length | 13 |
| Mean length | 13.18376153 |
| Min length | 1 |
Characters and Unicode
| Total characters | 794348 |
|---|---|
| Distinct characters | 73 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 1748 ? |
|---|---|
| Unique (%) | 2.9% |
Sample
| 1st row | W 57 |
|---|---|
| 2nd row | SOUTH CONDUIT AVENUE |
| 3rd row | 68 STREET |
| 4th row | MADISON AVENUE |
| 5th row | NORTHERN BOULEVARD |
| Value | Count | Frequency (%) |
| 3 AVENUE | 562 | 0.4% |
| BROADWAY | 523 | 0.4% |
| 2 AVENUE | 387 | 0.3% |
| LINDEN BOULEVARD | 346 | 0.3% |
| 5 AVENUE | 319 | 0.2% |
| PARK AVENUE | 311 | 0.2% |
| 1 AVENUE | 290 | 0.2% |
| ATLANTIC AVENUE | 274 | 0.2% |
| BRUCKNER BOULEVARD | 268 | 0.2% |
| 7 AVENUE | 245 | 0.2% |
| Other values (5260) | 56727 | |
| (Missing) | 68115 |
| Value | Count | Frequency (%) |
| avenue | 26048 | 19.5% |
| street | 21239 | 15.9% |
| east | 5900 | 4.4% |
| boulevard | 3595 | 2.7% |
| road | 2682 | 2.0% |
| west | 2572 | 1.9% |
| place | 1610 | 1.2% |
| parkway | 1323 | 1.0% |
| expressway | 833 | 0.6% |
| park | 675 | 0.5% |
| Other values (2895) | 67078 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 137189 | |
| 73306 | 9.2% | |
| T | 67505 | 8.5% |
| A | 67113 | 8.4% |
| R | 54450 | 6.9% |
| N | 50480 | 6.4% |
| S | 46479 | 5.9% |
| U | 36535 | 4.6% |
| V | 33232 | 4.2% |
| O | 27783 | 3.5% |
| Other values (63) | 200276 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 663364 | |
| Space Separator | 73306 | 9.2% |
| Decimal Number | 51042 | 6.4% |
| Lowercase Letter | 6605 | 0.8% |
| Other Punctuation | 28 | < 0.1% |
| Control | 1 | < 0.1% |
| Other Number | 1 | < 0.1% |
| Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| e | 1232 | |
| t | 652 | |
| a | 633 | |
| r | 532 | 8.1% |
| n | 480 | 7.3% |
| s | 403 | 6.1% |
| o | 303 | 4.6% |
| u | 298 | 4.5% |
| v | 293 | 4.4% |
| l | 249 | 3.8% |
| Other values (17) | 1530 |
| Value | Count | Frequency (%) |
| E | 137189 | |
| T | 67505 | |
| A | 67113 | |
| R | 54450 | 8.2% |
| N | 50480 | 7.6% |
| S | 46479 | 7.0% |
| U | 36535 | 5.5% |
| V | 33232 | 5.0% |
| O | 27783 | 4.2% |
| L | 21398 | 3.2% |
| Other values (16) | 121200 |
| Value | Count | Frequency (%) |
| 1 | 12085 | |
| 2 | 5965 | |
| 3 | 5406 | |
| 4 | 4635 | 9.1% |
| 5 | 4490 | 8.8% |
| 7 | 3970 | 7.8% |
| 6 | 3878 | 7.6% |
| 8 | 3877 | 7.6% |
| 9 | 3400 | 6.7% |
| 0 | 3336 | 6.5% |
| Value | Count | Frequency (%) |
| / | 11 | |
| . | 7 | |
| & | 6 | |
| , | 2 | 7.1% |
| ¿ | 1 | 3.6% |
| ' | 1 | 3.6% |
| Value | Count | Frequency (%) |
| 73306 |
| Value | Count | Frequency (%) |
| | 1 |
| Value | Count | Frequency (%) |
| ½ | 1 |
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 669969 | |
| Common | 124379 | 15.7% |
Most frequent character per script
| Value | Count | Frequency (%) |
| E | 137189 | |
| T | 67505 | |
| A | 67113 | |
| R | 54450 | 8.1% |
| N | 50480 | 7.5% |
| S | 46479 | 6.9% |
| U | 36535 | 5.5% |
| V | 33232 | 5.0% |
| O | 27783 | 4.1% |
| L | 21398 | 3.2% |
| Other values (43) | 127805 |
| Value | Count | Frequency (%) |
| 73306 | ||
| 1 | 12085 | 9.7% |
| 2 | 5965 | 4.8% |
| 3 | 5406 | 4.3% |
| 4 | 4635 | 3.7% |
| 5 | 4490 | 3.6% |
| 7 | 3970 | 3.2% |
| 6 | 3878 | 3.1% |
| 8 | 3877 | 3.1% |
| 9 | 3400 | 2.7% |
| Other values (10) | 3367 | 2.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 794345 | |
| None | 3 | < 0.1% |
Most frequent character per block
| Value | Count | Frequency (%) |
| E | 137189 | |
| 73306 | 9.2% | |
| T | 67505 | 8.5% |
| A | 67113 | 8.4% |
| R | 54450 | 6.9% |
| N | 50480 | 6.4% |
| S | 46479 | 5.9% |
| U | 36535 | 4.6% |
| V | 33232 | 4.2% |
| O | 27783 | 3.5% |
| Other values (60) | 200273 |
| Value | Count | Frequency (%) |
| ï | 1 | |
| ¿ | 1 | |
| ½ | 1 |
| Distinct | 28958 |
|---|---|
| Distinct (%) | 85.8% |
| Missing | 94598 |
| Missing (%) | 73.7% |
| Memory size | 6.0 MiB |
| 772 EDGEWATER ROAD | 35 |
|---|---|
| 625 ATLANTIC AVENUE | 22 |
| 815 HUTCHINSON RIVER PARKWAY | 21 |
| 501 GATEWAY DRIVE | 19 |
| 355 FOOD CENTER DRIVE | 19 |
| Other values (28953) |
Length
| Max length | 40 |
|---|---|
| Median length | 40 |
| Mean length | 40 |
| Min length | 40 |
Characters and Unicode
| Total characters | 1350760 |
|---|---|
| Distinct characters | 69 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 25915 ? |
|---|---|
| Unique (%) | 76.7% |
Sample
| 1st row | 760 BROADWAY |
|---|---|
| 2nd row | 948 EAST 179 STREET |
| 3rd row | 793 FLATBUSH AVENUE |
| 4th row | 1539 PARK PLACE |
| 5th row | 1316 UTICA AVENUE |
| Value | Count | Frequency (%) |
| 772 EDGEWATER ROAD | 35 | < 0.1% |
| 625 ATLANTIC AVENUE | 22 | < 0.1% |
| 815 HUTCHINSON RIVER PARKWAY | 21 | < 0.1% |
| 501 GATEWAY DRIVE | 19 | < 0.1% |
| 355 FOOD CENTER DRIVE | 19 | < 0.1% |
| 110-00 ROCKAWAY BOULEVARD | 19 | < 0.1% |
| 450 FLATBUSH AVENUE | 18 | < 0.1% |
| 1400 PELHAM PARKWAY SOUTH | 16 | < 0.1% |
| 63 FLUSHING AVENUE | 16 | < 0.1% |
| 519 GATEWAY DRIVE | 16 | < 0.1% |
| Other values (28948) | 33568 | 26.2% |
| (Missing) | 94598 |
| Value | Count | Frequency (%) |
| avenue | 14010 | 12.8% |
| street | 12639 | 11.6% |
| east | 3709 | 3.4% |
| boulevard | 1929 | 1.8% |
| west | 1877 | 1.7% |
| road | 1609 | 1.5% |
| place | 771 | 0.7% |
| parkway | 754 | 0.7% |
| drive | 471 | 0.4% |
| broadway | 421 | 0.4% |
| Other values (11343) | 71237 |
Most occurring characters
| Value | Count | Frequency (%) |
| 810625 | ||
| E | 78430 | 5.8% |
| T | 41761 | 3.1% |
| A | 39169 | 2.9% |
| R | 31506 | 2.3% |
| N | 28886 | 2.1% |
| 1 | 27887 | 2.1% |
| S | 27882 | 2.1% |
| U | 20076 | 1.5% |
| 2 | 18400 | 1.4% |
| Other values (59) | 226138 | 16.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Space Separator | 810625 | |
| Uppercase Letter | 385680 | |
| Decimal Number | 144412 | 10.7% |
| Dash Punctuation | 7696 | 0.6% |
| Lowercase Letter | 2330 | 0.2% |
| Other Punctuation | 15 | < 0.1% |
| Control | 1 | < 0.1% |
| Connector Punctuation | 1 | < 0.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| E | 78430 | |
| T | 41761 | |
| A | 39169 | |
| R | 31506 | |
| N | 28886 | 7.5% |
| S | 27882 | 7.2% |
| U | 20076 | 5.2% |
| V | 18006 | 4.7% |
| O | 16142 | 4.2% |
| L | 12350 | 3.2% |
| Other values (16) | 71472 |
| Value | Count | Frequency (%) |
| e | 394 | |
| t | 244 | |
| r | 225 | |
| a | 212 | 9.1% |
| s | 158 | 6.8% |
| n | 143 | 6.1% |
| o | 124 | 5.3% |
| d | 98 | 4.2% |
| l | 96 | 4.1% |
| v | 92 | 3.9% |
| Other values (16) | 544 |
| Value | Count | Frequency (%) |
| 1 | 27887 | |
| 2 | 18400 | |
| 0 | 15999 | |
| 3 | 14607 | |
| 5 | 14488 | |
| 4 | 12874 | |
| 6 | 10928 | 7.6% |
| 7 | 10249 | 7.1% |
| 8 | 9931 | 6.9% |
| 9 | 9049 | 6.3% |
| Value | Count | Frequency (%) |
| / | 7 | |
| . | 7 | |
| ! | 1 | 6.7% |
| Value | Count | Frequency (%) |
| 810625 |
| Value | Count | Frequency (%) |
| - | 7696 |
| Value | Count | Frequency (%) |
| | 1 |
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 962750 | |
| Latin | 388010 |
Most frequent character per script
| Value | Count | Frequency (%) |
| E | 78430 | |
| T | 41761 | |
| A | 39169 | |
| R | 31506 | |
| N | 28886 | 7.4% |
| S | 27882 | 7.2% |
| U | 20076 | 5.2% |
| V | 18006 | 4.6% |
| O | 16142 | 4.2% |
| L | 12350 | 3.2% |
| Other values (42) | 73802 |
| Value | Count | Frequency (%) |
| 810625 | ||
| 1 | 27887 | 2.9% |
| 2 | 18400 | 1.9% |
| 0 | 15999 | 1.7% |
| 3 | 14607 | 1.5% |
| 5 | 14488 | 1.5% |
| 4 | 12874 | 1.3% |
| 6 | 10928 | 1.1% |
| 7 | 10249 | 1.1% |
| 8 | 9931 | 1.0% |
| Other values (7) | 16762 | 1.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1350760 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 810625 | ||
| E | 78430 | 5.8% |
| T | 41761 | 3.1% |
| A | 39169 | 2.9% |
| R | 31506 | 2.3% |
| N | 28886 | 2.1% |
| 1 | 27887 | 2.1% |
| S | 27882 | 2.1% |
| U | 20076 | 1.5% |
| 2 | 18400 | 1.4% |
| Other values (59) | 226138 | 16.7% |
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.3942554882 |
|---|---|
| Minimum | 0 |
| Maximum | 16 |
| Zeros | 90759 |
| Zeros (%) | 70.7% |
| Memory size | 1003.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 16 |
| Range | 16 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.749183193 |
|---|---|
| Coefficient of variation (CV) | 1.900247975 |
| Kurtosis | 16.64754482 |
| Mean | 0.3942554882 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.029441689 |
| Sum | 50609 |
| Variance | 0.5612754567 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 90759 | |
| 1 | 29191 | 22.7% |
| 2 | 5576 | 4.3% |
| 3 | 1795 | 1.4% |
| 4 | 653 | 0.5% |
| 5 | 243 | 0.2% |
| 6 | 72 | 0.1% |
| 7 | 39 | < 0.1% |
| 8 | 18 | < 0.1% |
| 9 | 9 | < 0.1% |
| Other values (4) | 11 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 90759 | |
| 1 | 29191 | 22.7% |
| 2 | 5576 | 4.3% |
| 3 | 1795 | 1.4% |
| 4 | 653 | 0.5% |
| 5 | 243 | 0.2% |
| 6 | 72 | 0.1% |
| 7 | 39 | < 0.1% |
| 8 | 18 | < 0.1% |
| 9 | 9 | < 0.1% |
| Value | Count | Frequency (%) |
| 16 | 1 | < 0.1% |
| 15 | 1 | < 0.1% |
| 11 | 3 | < 0.1% |
| 10 | 6 | < 0.1% |
| 9 | 9 | < 0.1% |
| 8 | 18 | < 0.1% |
| 7 | 39 | < 0.1% |
| 6 | 72 | 0.1% |
| 5 | 243 | 0.2% |
| 4 | 653 |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.1 MiB |
| 0 | |
|---|---|
| 1 | 269 |
| 2 | 10 |
| 3 | 2 |
| 4 | 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 128367 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
| Value | Count | Frequency (%) |
| 0 | 128085 | |
| 1 | 269 | 0.2% |
| 2 | 10 | < 0.1% |
| 3 | 2 | < 0.1% |
| 4 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 128085 | |
| 1 | 269 | 0.2% |
| 2 | 10 | < 0.1% |
| 3 | 2 | < 0.1% |
| 4 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 128085 | |
| 1 | 269 | 0.2% |
| 2 | 10 | < 0.1% |
| 3 | 2 | < 0.1% |
| 4 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 128367 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 128085 | |
| 1 | 269 | 0.2% |
| 2 | 10 | < 0.1% |
| 3 | 2 | < 0.1% |
| 4 | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 128367 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 128085 | |
| 1 | 269 | 0.2% |
| 2 | 10 | < 0.1% |
| 3 | 2 | < 0.1% |
| 4 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 128367 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 128085 | |
| 1 | 269 | 0.2% |
| 2 | 10 | < 0.1% |
| 3 | 2 | < 0.1% |
| 4 | 1 | < 0.1% |
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.05964149665 |
|---|---|
| Minimum | 0 |
| Maximum | 7 |
| Zeros | 120992 |
| Zeros (%) | 94.3% |
| Memory size | 1003.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 7 |
| Range | 7 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.2480685043 |
|---|---|
| Coefficient of variation (CV) | 4.159327284 |
| Kurtosis | 29.04223204 |
| Mean | 0.05964149665 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.574181181 |
| Sum | 7656 |
| Variance | 0.06153798281 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 120992 | |
| 1 | 7139 | 5.6% |
| 2 | 205 | 0.2% |
| 3 | 23 | < 0.1% |
| 4 | 5 | < 0.1% |
| 7 | 1 | < 0.1% |
| 6 | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 120992 | |
| 1 | 7139 | 5.6% |
| 2 | 205 | 0.2% |
| 3 | 23 | < 0.1% |
| 4 | 5 | < 0.1% |
| 5 | 1 | < 0.1% |
| 6 | 1 | < 0.1% |
| 7 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 7 | 1 | < 0.1% |
| 6 | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
| 4 | 5 | < 0.1% |
| 3 | 23 | < 0.1% |
| 2 | 205 | 0.2% |
| 1 | 7139 | 5.6% |
| 0 | 120992 |
NUMBER OF PEDESTRIANS KILLED
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.1 MiB |
| 0 | |
|---|---|
| 1 | 120 |
| 2 | 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 128367 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
| Value | Count | Frequency (%) |
| 0 | 128246 | |
| 1 | 120 | 0.1% |
| 2 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 128246 | |
| 1 | 120 | 0.1% |
| 2 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 128246 | |
| 1 | 120 | 0.1% |
| 2 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 128367 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 128246 | |
| 1 | 120 | 0.1% |
| 2 | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 128367 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 128246 | |
| 1 | 120 | 0.1% |
| 2 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 128367 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 128246 | |
| 1 | 120 | 0.1% |
| 2 | 1 | < 0.1% |
NUMBER OF CYCLIST INJURED
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.1 MiB |
| 0 | |
|---|---|
| 1 | 5765 |
| 2 | 125 |
| 3 | 2 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 128367 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
| Value | Count | Frequency (%) |
| 0 | 122475 | |
| 1 | 5765 | 4.5% |
| 2 | 125 | 0.1% |
| 3 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 122475 | |
| 1 | 5765 | 4.5% |
| 2 | 125 | 0.1% |
| 3 | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 122475 | |
| 1 | 5765 | 4.5% |
| 2 | 125 | 0.1% |
| 3 | 2 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 128367 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 122475 | |
| 1 | 5765 | 4.5% |
| 2 | 125 | 0.1% |
| 3 | 2 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 128367 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 122475 | |
| 1 | 5765 | 4.5% |
| 2 | 125 | 0.1% |
| 3 | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 128367 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 122475 | |
| 1 | 5765 | 4.5% |
| 2 | 125 | 0.1% |
| 3 | 2 | < 0.1% |
NUMBER OF CYCLIST KILLED
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.1 MiB |
| 0 | |
|---|---|
| 1 | 30 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 128367 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
| Value | Count | Frequency (%) |
| 0 | 128337 | |
| 1 | 30 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 128337 | |
| 1 | 30 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 128337 | |
| 1 | 30 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 128367 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 128337 | |
| 1 | 30 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 128367 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 128337 | |
| 1 | 30 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 128367 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 128337 | |
| 1 | 30 | < 0.1% |
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.2877141321 |
|---|---|
| Minimum | 0 |
| Maximum | 16 |
| Zeros | 103780 |
| Zeros (%) | 80.8% |
| Memory size | 1003.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 16 |
| Range | 16 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.7168114962 |
|---|---|
| Coefficient of variation (CV) | 2.491401764 |
| Kurtosis | 21.80086343 |
| Mean | 0.2877141321 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.709884974 |
| Sum | 36933 |
| Variance | 0.5138187211 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 103780 | |
| 1 | 16707 | 13.0% |
| 2 | 5111 | 4.0% |
| 3 | 1747 | 1.4% |
| 4 | 644 | 0.5% |
| 5 | 235 | 0.2% |
| 6 | 70 | 0.1% |
| 7 | 37 | < 0.1% |
| 8 | 16 | < 0.1% |
| 9 | 9 | < 0.1% |
| Other values (4) | 11 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 103780 | |
| 1 | 16707 | 13.0% |
| 2 | 5111 | 4.0% |
| 3 | 1747 | 1.4% |
| 4 | 644 | 0.5% |
| 5 | 235 | 0.2% |
| 6 | 70 | 0.1% |
| 7 | 37 | < 0.1% |
| 8 | 16 | < 0.1% |
| 9 | 9 | < 0.1% |
| Value | Count | Frequency (%) |
| 16 | 1 | < 0.1% |
| 15 | 1 | < 0.1% |
| 11 | 3 | < 0.1% |
| 10 | 6 | < 0.1% |
| 9 | 9 | < 0.1% |
| 8 | 16 | < 0.1% |
| 7 | 37 | < 0.1% |
| 6 | 70 | 0.1% |
| 5 | 235 | 0.2% |
| 4 | 644 |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.1 MiB |
| 0 | |
|---|---|
| 1 | 121 |
| 2 | 8 |
| 3 | 2 |
| 4 | 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 128367 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
| Value | Count | Frequency (%) |
| 0 | 128235 | |
| 1 | 121 | 0.1% |
| 2 | 8 | < 0.1% |
| 3 | 2 | < 0.1% |
| 4 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 128235 | |
| 1 | 121 | 0.1% |
| 2 | 8 | < 0.1% |
| 3 | 2 | < 0.1% |
| 4 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 128235 | |
| 1 | 121 | 0.1% |
| 2 | 8 | < 0.1% |
| 3 | 2 | < 0.1% |
| 4 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 128367 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 128235 | |
| 1 | 121 | 0.1% |
| 2 | 8 | < 0.1% |
| 3 | 2 | < 0.1% |
| 4 | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 128367 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 128235 | |
| 1 | 121 | 0.1% |
| 2 | 8 | < 0.1% |
| 3 | 2 | < 0.1% |
| 4 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 128367 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 128235 | |
| 1 | 121 | 0.1% |
| 2 | 8 | < 0.1% |
| 3 | 2 | < 0.1% |
| 4 | 1 | < 0.1% |
| Distinct | 55 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 590 |
| Missing (%) | 0.5% |
| Memory size | 9.5 MiB |
| Unspecified | |
|---|---|
| Driver Inattention/Distraction | |
| Following Too Closely | |
| Failure to Yield Right-of-Way | |
| Passing or Lane Usage Improper | |
| Other values (50) |
Length
| Max length | 53 |
|---|---|
| Median length | 20 |
| Mean length | 21.02476972 |
| Min length | 5 |
Characters and Unicode
| Total characters | 2686482 |
|---|---|
| Distinct characters | 52 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Tire Failure/Inadequate |
|---|---|
| 2nd row | Unspecified |
| 3rd row | Driver Inattention/Distraction |
| 4th row | Pedestrian/Bicyclist/Other Pedestrian Error/Confusion |
| 5th row | Driver Inattention/Distraction |
| Value | Count | Frequency (%) |
| Unspecified | 33811 | |
| Driver Inattention/Distraction | 32243 | |
| Following Too Closely | 8514 | 6.6% |
| Failure to Yield Right-of-Way | 8092 | 6.3% |
| Passing or Lane Usage Improper | 4821 | 3.8% |
| Passing Too Closely | 4675 | 3.6% |
| Backing Unsafely | 4577 | 3.6% |
| Other Vehicular | 3773 | 2.9% |
| Unsafe Speed | 3759 | 2.9% |
| Unsafe Lane Changing | 2890 | 2.3% |
| Other values (45) | 20622 |
| Value | Count | Frequency (%) |
| driver | 34488 | 12.4% |
| unspecified | 33811 | 12.2% |
| inattention/distraction | 32243 | 11.6% |
| too | 13189 | 4.8% |
| closely | 13189 | 4.8% |
| to | 10246 | 3.7% |
| passing | 9496 | 3.4% |
| failure | 8518 | 3.1% |
| following | 8514 | 3.1% |
| yield | 8092 | 2.9% |
| Other values (93) | 105421 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 288170 | 10.7% |
| e | 264106 | 9.8% |
| n | 241513 | 9.0% |
| t | 202968 | 7.6% |
| o | 172821 | 6.4% |
| r | 167803 | 6.2% |
| 149430 | 5.6% | |
| a | 142479 | 5.3% |
| s | 129683 | 4.8% |
| c | 91519 | 3.4% |
| Other values (42) | 835990 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2174095 | |
| Uppercase Letter | 308224 | 11.5% |
| Space Separator | 149430 | 5.6% |
| Other Punctuation | 38069 | 1.4% |
| Dash Punctuation | 16304 | 0.6% |
| Open Punctuation | 180 | < 0.1% |
| Close Punctuation | 180 | < 0.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| i | 288170 | |
| e | 264106 | |
| n | 241513 | |
| t | 202968 | |
| o | 172821 | |
| r | 167803 | |
| a | 142479 | 6.6% |
| s | 129683 | 6.0% |
| c | 91519 | 4.2% |
| l | 89424 | 4.1% |
| Other values (15) | 383609 |
| Value | Count | Frequency (%) |
| D | 72731 | |
| U | 51883 | |
| I | 44057 | |
| C | 20482 | 6.6% |
| T | 18707 | 6.1% |
| F | 17799 | 5.8% |
| P | 13448 | 4.4% |
| R | 12321 | 4.0% |
| L | 9069 | 2.9% |
| W | 8173 | 2.7% |
| Other values (12) | 39554 |
| Value | Count | Frequency (%) |
| 149430 |
| Value | Count | Frequency (%) |
| / | 38069 |
| Value | Count | Frequency (%) |
| - | 16304 |
| Value | Count | Frequency (%) |
| ( | 180 |
| Value | Count | Frequency (%) |
| ) | 180 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2482319 | |
| Common | 204163 | 7.6% |
Most frequent character per script
| Value | Count | Frequency (%) |
| i | 288170 | |
| e | 264106 | 10.6% |
| n | 241513 | 9.7% |
| t | 202968 | 8.2% |
| o | 172821 | 7.0% |
| r | 167803 | 6.8% |
| a | 142479 | 5.7% |
| s | 129683 | 5.2% |
| c | 91519 | 3.7% |
| l | 89424 | 3.6% |
| Other values (37) | 691833 |
| Value | Count | Frequency (%) |
| 149430 | ||
| / | 38069 | 18.6% |
| - | 16304 | 8.0% |
| ( | 180 | 0.1% |
| ) | 180 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2686482 |
Most frequent character per block
| Value | Count | Frequency (%) |
| i | 288170 | 10.7% |
| e | 264106 | 9.8% |
| n | 241513 | 9.0% |
| t | 202968 | 7.6% |
| o | 172821 | 6.4% |
| r | 167803 | 6.2% |
| 149430 | 5.6% | |
| a | 142479 | 5.3% |
| s | 129683 | 4.8% |
| c | 91519 | 3.4% |
| Other values (42) | 835990 |
| Distinct | 48 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 28507 |
| Missing (%) | 22.2% |
| Memory size | 7.5 MiB |
| Unspecified | |
|---|---|
| Driver Inattention/Distraction | 5936 |
| Other Vehicular | 1552 |
| Following Too Closely | 1521 |
| Passing or Lane Usage Improper | 853 |
| Other values (43) | 5302 |
Length
| Max length | 53 |
|---|---|
| Median length | 11 |
| Mean length | 13.13994592 |
| Min length | 5 |
Characters and Unicode
| Total characters | 1312155 |
|---|---|
| Distinct characters | 52 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Unspecified |
|---|---|
| 2nd row | Unspecified |
| 3rd row | Driver Inattention/Distraction |
| 4th row | Unspecified |
| 5th row | Other Vehicular |
| Value | Count | Frequency (%) |
| Unspecified | 84696 | |
| Driver Inattention/Distraction | 5936 | 4.6% |
| Other Vehicular | 1552 | 1.2% |
| Following Too Closely | 1521 | 1.2% |
| Passing or Lane Usage Improper | 853 | 0.7% |
| Failure to Yield Right-of-Way | 812 | 0.6% |
| Passing Too Closely | 601 | 0.5% |
| Unsafe Speed | 551 | 0.4% |
| Traffic Control Disregarded | 487 | 0.4% |
| Unsafe Lane Changing | 446 | 0.3% |
| Other values (38) | 2405 | 1.9% |
| (Missing) | 28507 | 22.2% |
| Value | Count | Frequency (%) |
| unspecified | 84696 | |
| driver | 6201 | 5.0% |
| inattention/distraction | 5936 | 4.8% |
| closely | 2122 | 1.7% |
| too | 2122 | 1.7% |
| other | 1567 | 1.3% |
| vehicular | 1552 | 1.3% |
| following | 1521 | 1.2% |
| passing | 1454 | 1.2% |
| lane | 1324 | 1.1% |
| Other values (81) | 14795 | 12.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 206639 | |
| e | 201390 | |
| n | 119405 | |
| s | 100187 | |
| c | 94901 | |
| d | 88543 | |
| p | 88330 | |
| f | 88109 | |
| U | 87091 | |
| t | 36316 | 2.8% |
| Other values (42) | 201244 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1150613 | |
| Uppercase Letter | 129295 | 9.9% |
| Space Separator | 23430 | 1.8% |
| Other Punctuation | 7131 | 0.5% |
| Dash Punctuation | 1662 | 0.1% |
| Open Punctuation | 12 | < 0.1% |
| Close Punctuation | 12 | < 0.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| i | 206639 | |
| e | 201390 | |
| n | 119405 | |
| s | 100187 | |
| c | 94901 | |
| d | 88543 | |
| p | 88330 | |
| f | 88109 | |
| t | 36316 | 3.2% |
| r | 30681 | 2.7% |
| Other values (15) | 96112 |
| Value | Count | Frequency (%) |
| U | 87091 | |
| D | 12957 | 10.0% |
| I | 7506 | 5.8% |
| C | 3374 | 2.6% |
| T | 2921 | 2.3% |
| F | 2392 | 1.9% |
| O | 2223 | 1.7% |
| P | 2198 | 1.7% |
| V | 2146 | 1.7% |
| L | 1556 | 1.2% |
| Other values (12) | 4931 | 3.8% |
| Value | Count | Frequency (%) |
| 23430 |
| Value | Count | Frequency (%) |
| / | 7131 |
| Value | Count | Frequency (%) |
| - | 1662 |
| Value | Count | Frequency (%) |
| ( | 12 |
| Value | Count | Frequency (%) |
| ) | 12 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1279908 | |
| Common | 32247 | 2.5% |
Most frequent character per script
| Value | Count | Frequency (%) |
| i | 206639 | |
| e | 201390 | |
| n | 119405 | |
| s | 100187 | |
| c | 94901 | |
| d | 88543 | |
| p | 88330 | |
| f | 88109 | |
| U | 87091 | |
| t | 36316 | 2.8% |
| Other values (37) | 168997 |
| Value | Count | Frequency (%) |
| 23430 | ||
| / | 7131 | 22.1% |
| - | 1662 | 5.2% |
| ( | 12 | < 0.1% |
| ) | 12 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1312155 |
Most frequent character per block
| Value | Count | Frequency (%) |
| i | 206639 | |
| e | 201390 | |
| n | 119405 | |
| s | 100187 | |
| c | 94901 | |
| d | 88543 | |
| p | 88330 | |
| f | 88109 | |
| U | 87091 | |
| t | 36316 | 2.8% |
| Other values (42) | 201244 |
| Distinct | 30 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 116070 |
| Missing (%) | 90.4% |
| Memory size | 4.3 MiB |
| Unspecified | |
|---|---|
| Other Vehicular | 247 |
| Following Too Closely | 222 |
| Driver Inattention/Distraction | 165 |
| Pavement Slippery | 26 |
| Other values (25) | 140 |
Length
| Max length | 53 |
|---|---|
| Median length | 11 |
| Mean length | 11.66414573 |
| Min length | 5 |
Characters and Unicode
| Total characters | 143434 |
|---|---|
| Distinct characters | 48 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Driver Inattention/Distraction |
|---|---|
| 2nd row | Unspecified |
| 3rd row | Unspecified |
| 4th row | Unspecified |
| 5th row | Following Too Closely |
| Value | Count | Frequency (%) |
| Unspecified | 11497 | 9.0% |
| Other Vehicular | 247 | 0.2% |
| Following Too Closely | 222 | 0.2% |
| Driver Inattention/Distraction | 165 | 0.1% |
| Pavement Slippery | 26 | < 0.1% |
| Reaction to Uninvolved Vehicle | 22 | < 0.1% |
| Unsafe Speed | 18 | < 0.1% |
| Unsafe Lane Changing | 11 | < 0.1% |
| Driver Inexperience | 10 | < 0.1% |
| Obstruction/Debris | 9 | < 0.1% |
| Other values (20) | 70 | 0.1% |
| (Missing) | 116070 |
| Value | Count | Frequency (%) |
| unspecified | 11497 | |
| other | 248 | 1.8% |
| vehicular | 247 | 1.8% |
| too | 231 | 1.7% |
| closely | 231 | 1.7% |
| following | 222 | 1.7% |
| driver | 175 | 1.3% |
| inattention/distraction | 165 | 1.2% |
| unsafe | 29 | 0.2% |
| to | 28 | 0.2% |
| Other values (51) | 357 | 2.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 24508 | |
| i | 24392 | |
| n | 12656 | |
| s | 12031 | |
| c | 12015 | |
| p | 11606 | |
| d | 11579 | |
| U | 11557 | |
| f | 11554 | |
| o | 1640 | 1.1% |
| Other values (38) | 9896 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 128481 | |
| Uppercase Letter | 13603 | 9.5% |
| Space Separator | 1133 | 0.8% |
| Other Punctuation | 201 | 0.1% |
| Dash Punctuation | 14 | < 0.1% |
| Open Punctuation | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| e | 24508 | |
| i | 24392 | |
| n | 12656 | |
| s | 12031 | |
| c | 12015 | |
| p | 11606 | |
| d | 11579 | |
| f | 11554 | |
| o | 1640 | 1.3% |
| l | 1286 | 1.0% |
| Other values (13) | 5214 | 4.1% |
| Value | Count | Frequency (%) |
| U | 11557 | |
| D | 371 | 2.7% |
| V | 278 | 2.0% |
| O | 268 | 2.0% |
| C | 256 | 1.9% |
| T | 243 | 1.8% |
| F | 229 | 1.7% |
| I | 197 | 1.4% |
| P | 50 | 0.4% |
| S | 45 | 0.3% |
| Other values (10) | 109 | 0.8% |
| Value | Count | Frequency (%) |
| 1133 |
| Value | Count | Frequency (%) |
| / | 201 |
| Value | Count | Frequency (%) |
| - | 14 |
| Value | Count | Frequency (%) |
| ( | 1 |
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 142084 | |
| Common | 1350 | 0.9% |
Most frequent character per script
| Value | Count | Frequency (%) |
| e | 24508 | |
| i | 24392 | |
| n | 12656 | |
| s | 12031 | |
| c | 12015 | |
| p | 11606 | |
| d | 11579 | |
| U | 11557 | |
| f | 11554 | |
| o | 1640 | 1.2% |
| Other values (33) | 8546 | 6.0% |
| Value | Count | Frequency (%) |
| 1133 | ||
| / | 201 | 14.9% |
| - | 14 | 1.0% |
| ( | 1 | 0.1% |
| ) | 1 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 143434 |
Most frequent character per block
| Value | Count | Frequency (%) |
| e | 24508 | |
| i | 24392 | |
| n | 12656 | |
| s | 12031 | |
| c | 12015 | |
| p | 11606 | |
| d | 11579 | |
| U | 11557 | |
| f | 11554 | |
| o | 1640 | 1.1% |
| Other values (38) | 9896 |
| Distinct | 16 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 125009 |
| Missing (%) | 97.4% |
| Memory size | 4.0 MiB |
| Unspecified | |
|---|---|
| Other Vehicular | 76 |
| Following Too Closely | 49 |
| Driver Inattention/Distraction | 35 |
| Pavement Slippery | 7 |
| Other values (11) | 22 |
Length
| Max length | 30 |
|---|---|
| Median length | 11 |
| Mean length | 11.51488982 |
| Min length | 11 |
Characters and Unicode
| Total characters | 38667 |
|---|---|
| Distinct characters | 39 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Unspecified |
|---|---|
| 2nd row | Unspecified |
| 3rd row | Unspecified |
| 4th row | Unspecified |
| 5th row | Unspecified |
| Value | Count | Frequency (%) |
| Unspecified | 3169 | 2.5% |
| Other Vehicular | 76 | 0.1% |
| Following Too Closely | 49 | < 0.1% |
| Driver Inattention/Distraction | 35 | < 0.1% |
| Pavement Slippery | 7 | < 0.1% |
| Reaction to Uninvolved Vehicle | 4 | < 0.1% |
| Unsafe Speed | 3 | < 0.1% |
| Driver Inexperience | 3 | < 0.1% |
| Outside Car Distraction | 2 | < 0.1% |
| Obstruction/Debris | 2 | < 0.1% |
| Other values (6) | 8 | < 0.1% |
| (Missing) | 125009 |
| Value | Count | Frequency (%) |
| unspecified | 3169 | |
| vehicular | 76 | 2.1% |
| other | 76 | 2.1% |
| closely | 51 | 1.4% |
| too | 51 | 1.4% |
| following | 49 | 1.4% |
| driver | 38 | 1.1% |
| inattention/distraction | 35 | 1.0% |
| slippery | 7 | 0.2% |
| pavement | 7 | 0.2% |
| Other values (25) | 52 | 1.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 6692 | |
| i | 6649 | |
| n | 3400 | |
| c | 3298 | |
| s | 3279 | |
| p | 3191 | |
| d | 3180 | |
| U | 3178 | |
| f | 3174 | |
| o | 344 | 0.9% |
| Other values (29) | 2282 | 5.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 34730 | |
| Uppercase Letter | 3645 | 9.4% |
| Space Separator | 253 | 0.7% |
| Other Punctuation | 39 | 0.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| e | 6692 | |
| i | 6649 | |
| n | 3400 | |
| c | 3298 | |
| s | 3279 | |
| p | 3191 | |
| d | 3180 | |
| f | 3174 | |
| o | 344 | 1.0% |
| l | 295 | 0.8% |
| Other values (13) | 1228 | 3.5% |
| Value | Count | Frequency (%) |
| U | 3178 | |
| D | 80 | 2.2% |
| O | 80 | 2.2% |
| V | 80 | 2.2% |
| C | 53 | 1.5% |
| T | 51 | 1.4% |
| F | 49 | 1.3% |
| I | 40 | 1.1% |
| P | 10 | 0.3% |
| S | 10 | 0.3% |
| Other values (4) | 14 | 0.4% |
| Value | Count | Frequency (%) |
| 253 |
| Value | Count | Frequency (%) |
| / | 39 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 38375 | |
| Common | 292 | 0.8% |
Most frequent character per script
| Value | Count | Frequency (%) |
| e | 6692 | |
| i | 6649 | |
| n | 3400 | |
| c | 3298 | |
| s | 3279 | |
| p | 3191 | |
| d | 3180 | |
| U | 3178 | |
| f | 3174 | |
| o | 344 | 0.9% |
| Other values (27) | 1990 | 5.2% |
| Value | Count | Frequency (%) |
| 253 | ||
| / | 39 | 13.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 38667 |
Most frequent character per block
| Value | Count | Frequency (%) |
| e | 6692 | |
| i | 6649 | |
| n | 3400 | |
| c | 3298 | |
| s | 3279 | |
| p | 3191 | |
| d | 3180 | |
| U | 3178 | |
| f | 3174 | |
| o | 344 | 0.9% |
| Other values (29) | 2282 | 5.9% |
| Distinct | 10 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 127377 |
| Missing (%) | 99.2% |
| Memory size | 4.0 MiB |
| Unspecified | |
|---|---|
| Other Vehicular | 29 |
| Following Too Closely | 17 |
| Driver Inattention/Distraction | 6 |
| Pavement Slippery | 5 |
| Other values (5) | 7 |
Length
| Max length | 30 |
|---|---|
| Median length | 11 |
| Mean length | 11.48989899 |
| Min length | 11 |
Characters and Unicode
| Total characters | 11375 |
|---|---|
| Distinct characters | 34 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | Unspecified |
|---|---|
| 2nd row | Unspecified |
| 3rd row | Unspecified |
| 4th row | Unspecified |
| 5th row | Unspecified |
| Value | Count | Frequency (%) |
| Unspecified | 926 | 0.7% |
| Other Vehicular | 29 | < 0.1% |
| Following Too Closely | 17 | < 0.1% |
| Driver Inattention/Distraction | 6 | < 0.1% |
| Pavement Slippery | 5 | < 0.1% |
| Obstruction/Debris | 2 | < 0.1% |
| Outside Car Distraction | 2 | < 0.1% |
| Passing Too Closely | 1 | < 0.1% |
| Driver Inexperience | 1 | < 0.1% |
| Unsafe Speed | 1 | < 0.1% |
| (Missing) | 127377 |
| Value | Count | Frequency (%) |
| unspecified | 926 | |
| vehicular | 29 | 2.7% |
| other | 29 | 2.7% |
| closely | 18 | 1.7% |
| too | 18 | 1.7% |
| following | 17 | 1.6% |
| driver | 7 | 0.7% |
| inattention/distraction | 6 | 0.6% |
| slippery | 5 | 0.5% |
| pavement | 5 | 0.5% |
| Other values (8) | 12 | 1.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1967 | |
| i | 1940 | |
| n | 980 | |
| c | 966 | |
| s | 961 | |
| p | 938 | |
| d | 929 | |
| U | 927 | |
| f | 927 | |
| l | 104 | 0.9% |
| Other values (24) | 736 | 6.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10205 | |
| Uppercase Letter | 1080 | 9.5% |
| Space Separator | 82 | 0.7% |
| Other Punctuation | 8 | 0.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| e | 1967 | |
| i | 1940 | |
| n | 980 | |
| c | 966 | |
| s | 961 | |
| p | 938 | |
| d | 929 | |
| f | 927 | |
| l | 104 | 1.0% |
| o | 104 | 1.0% |
| Other values (12) | 389 | 3.8% |
| Value | Count | Frequency (%) |
| U | 927 | |
| O | 33 | 3.1% |
| V | 29 | 2.7% |
| C | 20 | 1.9% |
| T | 18 | 1.7% |
| F | 17 | 1.6% |
| D | 17 | 1.6% |
| I | 7 | 0.6% |
| P | 6 | 0.6% |
| S | 6 | 0.6% |
| Value | Count | Frequency (%) |
| 82 |
| Value | Count | Frequency (%) |
| / | 8 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11285 | |
| Common | 90 | 0.8% |
Most frequent character per script
| Value | Count | Frequency (%) |
| e | 1967 | |
| i | 1940 | |
| n | 980 | |
| c | 966 | |
| s | 961 | |
| p | 938 | |
| d | 929 | |
| U | 927 | |
| f | 927 | |
| l | 104 | 0.9% |
| Other values (22) | 646 | 5.7% |
| Value | Count | Frequency (%) |
| 82 | ||
| / | 8 | 8.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11375 |
Most frequent character per block
| Value | Count | Frequency (%) |
| e | 1967 | |
| i | 1940 | |
| n | 980 | |
| c | 966 | |
| s | 961 | |
| p | 938 | |
| d | 929 | |
| U | 927 | |
| f | 927 | |
| l | 104 | 0.9% |
| Other values (24) | 736 | 6.5% |
| Distinct | 128367 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4332874.476 |
|---|---|
| Minimum | 4063247 |
| Maximum | 4397407 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1003.0 KiB |
Quantile statistics
| Minimum | 4063247 |
|---|---|
| 5-th percentile | 4274921.3 |
| Q1 | 4300790.5 |
| median | 4332905 |
| Q3 | 4365013.5 |
| 95-th percentile | 4390704.7 |
| Maximum | 4397407 |
| Range | 334160 |
| Interquartile range (IQR) | 64223 |
Descriptive statistics
| Standard deviation | 37134.97788 |
|---|---|
| Coefficient of variation (CV) | 0.008570517814 |
| Kurtosis | -1.159355039 |
| Mean | 4332874.476 |
| Median Absolute Deviation (MAD) | 32112 |
| Skewness | -0.008462197276 |
| Sum | 5.561980978 × 1011 |
| Variance | 1379006582 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 4270043 | 1 | < 0.1% |
| 4322708 | 1 | < 0.1% |
| 4279683 | 1 | < 0.1% |
| 4289924 | 1 | < 0.1% |
| 4291973 | 1 | < 0.1% |
| 4285830 | 1 | < 0.1% |
| 4287879 | 1 | < 0.1% |
| 4273548 | 1 | < 0.1% |
| 4275597 | 1 | < 0.1% |
| 4269454 | 1 | < 0.1% |
| Other values (128357) | 128357 |
| Value | Count | Frequency (%) |
| 4063247 | 1 | |
| 4073803 | 1 | |
| 4267700 | 1 | |
| 4267732 | 1 | |
| 4267823 | 1 | |
| 4267839 | 1 | |
| 4267851 | 1 | |
| 4267864 | 1 | |
| 4267865 | 1 | |
| 4267868 | 1 |
| Value | Count | Frequency (%) |
| 4397407 | 1 | |
| 4397405 | 1 | |
| 4397404 | 1 | |
| 4397403 | 1 | |
| 4397401 | 1 | |
| 4397396 | 1 | |
| 4397395 | 1 | |
| 4397394 | 1 | |
| 4397390 | 1 | |
| 4397386 | 1 |
| Distinct | 414 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 1259 |
| Missing (%) | 1.0% |
| Memory size | 8.9 MiB |
| Sedan | |
|---|---|
| Station Wagon/Sport Utility Vehicle | |
| Taxi | 4122 |
| Pick-up Truck | 2996 |
| Box Truck | 2353 |
| Other values (409) |
Length
| Max length | 35 |
|---|---|
| Median length | 5 |
| Mean length | 16.43344243 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2088822 |
|---|---|
| Distinct characters | 65 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 261 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | Sedan |
|---|---|
| 2nd row | Taxi |
| 3rd row | Station Wagon/Sport Utility Vehicle |
| 4th row | Sedan |
| 5th row | Station Wagon/Sport Utility Vehicle |
| Value | Count | Frequency (%) |
| Sedan | 60122 | |
| Station Wagon/Sport Utility Vehicle | 46343 | |
| Taxi | 4122 | 3.2% |
| Pick-up Truck | 2996 | 2.3% |
| Box Truck | 2353 | 1.8% |
| Bus | 1725 | 1.3% |
| Bike | 1570 | 1.2% |
| Tractor Truck Diesel | 1038 | 0.8% |
| Motorcycle | 828 | 0.6% |
| Van | 766 | 0.6% |
| Other values (404) | 5245 | 4.1% |
| (Missing) | 1259 | 1.0% |
| Value | Count | Frequency (%) |
| sedan | 60336 | |
| vehicle | 46358 | |
| utility | 46355 | |
| station | 46343 | |
| wagon/sport | 46343 | |
| truck | 6833 | 2.5% |
| taxi | 4122 | 1.5% |
| pick-up | 2997 | 1.1% |
| box | 2369 | 0.9% |
| bus | 1760 | 0.6% |
| Other values (317) | 12413 | 4.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 235245 | |
| i | 196355 | 9.4% |
| e | 161903 | 7.8% |
| a | 161353 | 7.7% |
| n | 155216 | 7.4% |
| S | 153436 | 7.3% |
| 149123 | 7.1% | |
| o | 146949 | 7.0% |
| l | 96430 | 4.6% |
| d | 61058 | 2.9% |
| Other values (55) | 571754 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1563205 | |
| Uppercase Letter | 325896 | 15.6% |
| Space Separator | 149123 | 7.1% |
| Other Punctuation | 46477 | 2.2% |
| Dash Punctuation | 3844 | 0.2% |
| Decimal Number | 277 | < 0.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| S | 153436 | |
| V | 47198 | 14.5% |
| U | 46705 | 14.3% |
| W | 46518 | 14.3% |
| T | 12518 | 3.8% |
| B | 6525 | 2.0% |
| P | 3410 | 1.0% |
| D | 1553 | 0.5% |
| M | 1479 | 0.5% |
| A | 1234 | 0.4% |
| Other values (16) | 5320 | 1.6% |
| Value | Count | Frequency (%) |
| t | 235245 | |
| i | 196355 | |
| e | 161903 | |
| a | 161353 | |
| n | 155216 | |
| o | 146949 | |
| l | 96430 | |
| d | 61058 | 3.9% |
| c | 60415 | 3.9% |
| r | 59030 | 3.8% |
| Other values (14) | 229251 |
| Value | Count | Frequency (%) |
| 4 | 199 | |
| 3 | 32 | 11.6% |
| 2 | 19 | 6.9% |
| 1 | 8 | 2.9% |
| 5 | 6 | 2.2% |
| 0 | 5 | 1.8% |
| 7 | 3 | 1.1% |
| 8 | 3 | 1.1% |
| 6 | 2 | 0.7% |
| Value | Count | Frequency (%) |
| / | 46472 | |
| # | 2 | < 0.1% |
| . | 2 | < 0.1% |
| , | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 149123 |
| Value | Count | Frequency (%) |
| - | 3844 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1889101 | |
| Common | 199721 | 9.6% |
Most frequent character per script
| Value | Count | Frequency (%) |
| t | 235245 | |
| i | 196355 | |
| e | 161903 | 8.6% |
| a | 161353 | 8.5% |
| n | 155216 | 8.2% |
| S | 153436 | 8.1% |
| o | 146949 | 7.8% |
| l | 96430 | 5.1% |
| d | 61058 | 3.2% |
| c | 60415 | 3.2% |
| Other values (40) | 460741 |
| Value | Count | Frequency (%) |
| 149123 | ||
| / | 46472 | 23.3% |
| - | 3844 | 1.9% |
| 4 | 199 | 0.1% |
| 3 | 32 | < 0.1% |
| 2 | 19 | < 0.1% |
| 1 | 8 | < 0.1% |
| 5 | 6 | < 0.1% |
| 0 | 5 | < 0.1% |
| 7 | 3 | < 0.1% |
| Other values (5) | 10 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2088822 |
Most frequent character per block
| Value | Count | Frequency (%) |
| t | 235245 | |
| i | 196355 | 9.4% |
| e | 161903 | 7.8% |
| a | 161353 | 7.7% |
| n | 155216 | 7.4% |
| S | 153436 | 7.3% |
| 149123 | 7.1% | |
| o | 146949 | 7.0% |
| l | 96430 | 4.6% |
| d | 61058 | 2.9% |
| Other values (55) | 571754 |
| Distinct | 427 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 39742 |
| Missing (%) | 31.0% |
| Memory size | 7.4 MiB |
| Sedan | |
|---|---|
| Station Wagon/Sport Utility Vehicle | |
| Bike | |
| Box Truck | 2575 |
| Pick-up Truck | 2413 |
| Other values (422) |
Length
| Max length | 38 |
|---|---|
| Median length | 5 |
| Mean length | 15.6534725 |
| Min length | 2 |
Characters and Unicode
| Total characters | 1387289 |
|---|---|
| Distinct characters | 63 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 275 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | Pick-up Truck |
|---|---|
| 2nd row | Taxi |
| 3rd row | Sedan |
| 4th row | Station Wagon/Sport Utility Vehicle |
| 5th row | Sedan |
| Value | Count | Frequency (%) |
| Sedan | 38551 | |
| Station Wagon/Sport Utility Vehicle | 29555 | |
| Bike | 4082 | 3.2% |
| Box Truck | 2575 | 2.0% |
| Pick-up Truck | 2413 | 1.9% |
| Taxi | 2275 | 1.8% |
| Bus | 1466 | 1.1% |
| Tractor Truck Diesel | 994 | 0.8% |
| E-Scooter | 723 | 0.6% |
| Motorcycle | 717 | 0.6% |
| Other values (417) | 5274 | 4.1% |
| (Missing) | 39742 |
| Value | Count | Frequency (%) |
| sedan | 38680 | |
| vehicle | 29569 | |
| utility | 29560 | |
| wagon/sport | 29555 | |
| station | 29555 | |
| truck | 6419 | 3.4% |
| bike | 4088 | 2.2% |
| box | 2592 | 1.4% |
| pick-up | 2414 | 1.3% |
| taxi | 2275 | 1.2% |
| Other values (323) | 12042 | 6.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 151455 | 10.9% |
| i | 129566 | 9.3% |
| e | 109191 | 7.9% |
| a | 103687 | 7.5% |
| n | 99471 | 7.2% |
| S | 98626 | 7.1% |
| 98126 | 7.1% | |
| o | 97153 | 7.0% |
| l | 62267 | 4.5% |
| c | 42325 | 3.1% |
| Other values (53) | 395422 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1034963 | |
| Uppercase Letter | 220429 | 15.9% |
| Space Separator | 98126 | 7.1% |
| Other Punctuation | 29676 | 2.1% |
| Dash Punctuation | 3919 | 0.3% |
| Decimal Number | 175 | < 0.1% |
| Modifier Symbol | 1 | < 0.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| S | 98626 | |
| V | 30279 | 13.7% |
| U | 29876 | 13.6% |
| W | 29745 | 13.5% |
| T | 10256 | 4.7% |
| B | 9323 | 4.2% |
| P | 2783 | 1.3% |
| E | 1825 | 0.8% |
| D | 1521 | 0.7% |
| M | 1355 | 0.6% |
| Other values (16) | 4840 | 2.2% |
| Value | Count | Frequency (%) |
| t | 151455 | |
| i | 129566 | |
| e | 109191 | |
| a | 103687 | |
| n | 99471 | |
| o | 97153 | |
| l | 62267 | 6.0% |
| c | 42325 | 4.1% |
| r | 41728 | 4.0% |
| d | 39386 | 3.8% |
| Other values (14) | 158734 |
| Value | Count | Frequency (%) |
| 4 | 124 | |
| 3 | 26 | 14.9% |
| 2 | 10 | 5.7% |
| 0 | 5 | 2.9% |
| 1 | 5 | 2.9% |
| 6 | 3 | 1.7% |
| 8 | 1 | 0.6% |
| 5 | 1 | 0.6% |
| Value | Count | Frequency (%) |
| / | 29675 | |
| . | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| - | 3919 |
| Value | Count | Frequency (%) |
| 98126 |
| Value | Count | Frequency (%) |
| ` | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1255392 | |
| Common | 131897 | 9.5% |
Most frequent character per script
| Value | Count | Frequency (%) |
| t | 151455 | |
| i | 129566 | |
| e | 109191 | 8.7% |
| a | 103687 | 8.3% |
| n | 99471 | 7.9% |
| S | 98626 | 7.9% |
| o | 97153 | 7.7% |
| l | 62267 | 5.0% |
| c | 42325 | 3.4% |
| r | 41728 | 3.3% |
| Other values (40) | 319923 |
| Value | Count | Frequency (%) |
| 98126 | ||
| / | 29675 | 22.5% |
| - | 3919 | 3.0% |
| 4 | 124 | 0.1% |
| 3 | 26 | < 0.1% |
| 2 | 10 | < 0.1% |
| 0 | 5 | < 0.1% |
| 1 | 5 | < 0.1% |
| 6 | 3 | < 0.1% |
| 8 | 1 | < 0.1% |
| Other values (3) | 3 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1387289 |
Most frequent character per block
| Value | Count | Frequency (%) |
| t | 151455 | 10.9% |
| i | 129566 | 9.3% |
| e | 109191 | 7.9% |
| a | 103687 | 7.5% |
| n | 99471 | 7.2% |
| S | 98626 | 7.1% |
| 98126 | 7.1% | |
| o | 97153 | 7.0% |
| l | 62267 | 4.5% |
| c | 42325 | 3.1% |
| Other values (53) | 395422 |
| Distinct | 77 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 116772 |
| Missing (%) | 91.0% |
| Memory size | 4.4 MiB |
| Sedan | |
|---|---|
| Station Wagon/Sport Utility Vehicle | |
| Pick-up Truck | 264 |
| Taxi | 196 |
| Box Truck | 118 |
| Other values (72) | 501 |
Length
| Max length | 35 |
|---|---|
| Median length | 5 |
| Mean length | 17.74877102 |
| Min length | 2 |
Characters and Unicode
| Total characters | 205797 |
|---|---|
| Distinct characters | 53 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 38 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | Pick-up Truck |
|---|---|
| 2nd row | Sedan |
| 3rd row | Station Wagon/Sport Utility Vehicle |
| 4th row | Sedan |
| 5th row | Pick-up Truck |
| Value | Count | Frequency (%) |
| Sedan | 5728 | 4.5% |
| Station Wagon/Sport Utility Vehicle | 4788 | 3.7% |
| Pick-up Truck | 264 | 0.2% |
| Taxi | 196 | 0.2% |
| Box Truck | 118 | 0.1% |
| Bus | 67 | 0.1% |
| Tractor Truck Diesel | 60 | < 0.1% |
| Van | 54 | < 0.1% |
| Bike | 51 | < 0.1% |
| Motorcycle | 36 | < 0.1% |
| Other values (67) | 233 | 0.2% |
| (Missing) | 116772 |
| Value | Count | Frequency (%) |
| sedan | 5741 | |
| vehicle | 4791 | |
| station | 4788 | |
| utility | 4788 | |
| wagon/sport | 4788 | |
| truck | 464 | 1.7% |
| pick-up | 264 | 1.0% |
| taxi | 196 | 0.7% |
| box | 121 | 0.5% |
| bus | 69 | 0.3% |
| Other values (79) | 596 | 2.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 24135 | |
| i | 19797 | 9.6% |
| e | 15740 | 7.6% |
| a | 15719 | 7.6% |
| n | 15438 | 7.5% |
| S | 15326 | 7.4% |
| 15011 | 7.3% | |
| o | 14744 | 7.2% |
| l | 9766 | 4.7% |
| d | 5782 | 2.8% |
| Other values (43) | 54339 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 154137 | |
| Uppercase Letter | 31549 | 15.3% |
| Space Separator | 15011 | 7.3% |
| Other Punctuation | 4796 | 2.3% |
| Dash Punctuation | 288 | 0.1% |
| Decimal Number | 16 | < 0.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| t | 24135 | |
| i | 19797 | |
| e | 15740 | |
| a | 15719 | |
| n | 15438 | |
| o | 14744 | |
| l | 9766 | |
| d | 5782 | 3.8% |
| c | 5692 | 3.7% |
| r | 5560 | 3.6% |
| Other values (14) | 21764 |
| Value | Count | Frequency (%) |
| S | 15326 | |
| V | 4852 | 15.4% |
| U | 4811 | 15.2% |
| W | 4804 | 15.2% |
| T | 750 | 2.4% |
| P | 287 | 0.9% |
| B | 270 | 0.9% |
| D | 76 | 0.2% |
| C | 66 | 0.2% |
| M | 65 | 0.2% |
| Other values (13) | 242 | 0.8% |
| Value | Count | Frequency (%) |
| 4 | 13 | |
| 3 | 2 | 12.5% |
| 2 | 1 | 6.2% |
| Value | Count | Frequency (%) |
| - | 288 |
| Value | Count | Frequency (%) |
| 15011 |
| Value | Count | Frequency (%) |
| / | 4796 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 185686 | |
| Common | 20111 | 9.8% |
Most frequent character per script
| Value | Count | Frequency (%) |
| t | 24135 | |
| i | 19797 | |
| e | 15740 | 8.5% |
| a | 15719 | 8.5% |
| n | 15438 | 8.3% |
| S | 15326 | 8.3% |
| o | 14744 | 7.9% |
| l | 9766 | 5.3% |
| d | 5782 | 3.1% |
| c | 5692 | 3.1% |
| Other values (37) | 43547 |
| Value | Count | Frequency (%) |
| 15011 | ||
| / | 4796 | 23.8% |
| - | 288 | 1.4% |
| 4 | 13 | 0.1% |
| 3 | 2 | < 0.1% |
| 2 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 205797 |
Most frequent character per block
| Value | Count | Frequency (%) |
| t | 24135 | |
| i | 19797 | 9.6% |
| e | 15740 | 7.6% |
| a | 15719 | 7.6% |
| n | 15438 | 7.5% |
| S | 15326 | 7.4% |
| 15011 | 7.3% | |
| o | 14744 | 7.2% |
| l | 9766 | 4.7% |
| d | 5782 | 2.8% |
| Other values (43) | 54339 |
| Distinct | 33 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 125156 |
| Missing (%) | 97.5% |
| Memory size | 4.0 MiB |
| Sedan | |
|---|---|
| Station Wagon/Sport Utility Vehicle | |
| Pick-up Truck | 73 |
| Taxi | 47 |
| Box Truck | 16 |
| Other values (28) | 96 |
Length
| Max length | 35 |
|---|---|
| Median length | 5 |
| Mean length | 17.58704453 |
| Min length | 2 |
Characters and Unicode
| Total characters | 56472 |
|---|---|
| Distinct characters | 48 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 13 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | Station Wagon/Sport Utility Vehicle |
|---|---|
| 2nd row | Sedan |
| 3rd row | Sedan |
| 4th row | Pick-up Truck |
| 5th row | Sedan |
| Value | Count | Frequency (%) |
| Sedan | 1660 | 1.3% |
| Station Wagon/Sport Utility Vehicle | 1319 | 1.0% |
| Pick-up Truck | 73 | 0.1% |
| Taxi | 47 | < 0.1% |
| Box Truck | 16 | < 0.1% |
| Bus | 15 | < 0.1% |
| Motorcycle | 9 | < 0.1% |
| Van | 9 | < 0.1% |
| Convertible | 9 | < 0.1% |
| Bike | 8 | < 0.1% |
| Other values (23) | 46 | < 0.1% |
| (Missing) | 125156 |
| Value | Count | Frequency (%) |
| sedan | 1662 | |
| utility | 1319 | |
| vehicle | 1319 | |
| station | 1319 | |
| wagon/sport | 1319 | |
| truck | 97 | 1.3% |
| pick-up | 74 | 1.0% |
| taxi | 47 | 0.6% |
| box | 17 | 0.2% |
| bus | 15 | 0.2% |
| Other values (32) | 101 | 1.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 6629 | |
| i | 5427 | 9.6% |
| a | 4380 | 7.8% |
| e | 4367 | 7.7% |
| n | 4324 | 7.7% |
| S | 4301 | 7.6% |
| 4078 | 7.2% | |
| o | 4022 | 7.1% |
| l | 2670 | 4.7% |
| d | 1667 | 3.0% |
| Other values (38) | 14607 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 42351 | |
| Uppercase Letter | 8644 | 15.3% |
| Space Separator | 4078 | 7.2% |
| Other Punctuation | 1319 | 2.3% |
| Dash Punctuation | 78 | 0.1% |
| Decimal Number | 2 | < 0.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| S | 4301 | |
| V | 1331 | 15.4% |
| U | 1322 | 15.3% |
| W | 1319 | 15.3% |
| T | 156 | 1.8% |
| P | 79 | 0.9% |
| B | 43 | 0.5% |
| C | 17 | 0.2% |
| D | 16 | 0.2% |
| M | 12 | 0.1% |
| Other values (12) | 48 | 0.6% |
| Value | Count | Frequency (%) |
| t | 6629 | |
| i | 5427 | |
| a | 4380 | |
| e | 4367 | |
| n | 4324 | |
| o | 4022 | |
| l | 2670 | |
| d | 1667 | 3.9% |
| c | 1520 | 3.6% |
| r | 1469 | 3.5% |
| Other values (12) | 5876 |
| Value | Count | Frequency (%) |
| 4078 |
| Value | Count | Frequency (%) |
| / | 1319 |
| Value | Count | Frequency (%) |
| - | 78 |
| Value | Count | Frequency (%) |
| 4 | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 50995 | |
| Common | 5477 | 9.7% |
Most frequent character per script
| Value | Count | Frequency (%) |
| t | 6629 | |
| i | 5427 | |
| a | 4380 | 8.6% |
| e | 4367 | 8.6% |
| n | 4324 | 8.5% |
| S | 4301 | 8.4% |
| o | 4022 | 7.9% |
| l | 2670 | 5.2% |
| d | 1667 | 3.3% |
| c | 1520 | 3.0% |
| Other values (34) | 11688 |
| Value | Count | Frequency (%) |
| 4078 | ||
| / | 1319 | 24.1% |
| - | 78 | 1.4% |
| 4 | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 56472 |
Most frequent character per block
| Value | Count | Frequency (%) |
| t | 6629 | |
| i | 5427 | 9.6% |
| a | 4380 | 7.8% |
| e | 4367 | 7.7% |
| n | 4324 | 7.7% |
| S | 4301 | 7.6% |
| 4078 | 7.2% | |
| o | 4022 | 7.1% |
| l | 2670 | 4.7% |
| d | 1667 | 3.0% |
| Other values (38) | 14607 |
| Distinct | 19 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 127407 |
| Missing (%) | 99.3% |
| Memory size | 4.0 MiB |
| Sedan | |
|---|---|
| Station Wagon/Sport Utility Vehicle | |
| Pick-up Truck | 22 |
| Taxi | 14 |
| Van | 9 |
| Other values (14) | 32 |
Length
| Max length | 35 |
|---|---|
| Median length | 5 |
| Mean length | 17.734375 |
| Min length | 2 |
Characters and Unicode
| Total characters | 17025 |
|---|---|
| Distinct characters | 42 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | 0.7% |
Sample
| 1st row | Station Wagon/Sport Utility Vehicle |
|---|---|
| 2nd row | Sedan |
| 3rd row | Sedan |
| 4th row | Station Wagon/Sport Utility Vehicle |
| 5th row | Station Wagon/Sport Utility Vehicle |
| Value | Count | Frequency (%) |
| Sedan | 484 | 0.4% |
| Station Wagon/Sport Utility Vehicle | 399 | 0.3% |
| Pick-up Truck | 22 | < 0.1% |
| Taxi | 14 | < 0.1% |
| Van | 9 | < 0.1% |
| PK | 5 | < 0.1% |
| Box Truck | 5 | < 0.1% |
| Motorcycle | 4 | < 0.1% |
| Bus | 3 | < 0.1% |
| Tractor Truck Diesel | 3 | < 0.1% |
| Other values (9) | 12 | < 0.1% |
| (Missing) | 127407 |
| Value | Count | Frequency (%) |
| sedan | 484 | |
| station | 399 | |
| vehicle | 399 | |
| utility | 399 | |
| wagon/sport | 399 | |
| truck | 32 | 1.5% |
| pick-up | 22 | 1.0% |
| taxi | 14 | 0.6% |
| van | 10 | 0.5% |
| box | 7 | 0.3% |
| Other values (12) | 28 | 1.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 2012 | |
| i | 1640 | 9.6% |
| a | 1309 | 7.7% |
| e | 1303 | 7.7% |
| n | 1294 | 7.6% |
| S | 1283 | 7.5% |
| 1233 | 7.2% | |
| o | 1227 | 7.2% |
| l | 809 | 4.8% |
| d | 484 | 2.8% |
| Other values (32) | 4431 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12767 | |
| Uppercase Letter | 2603 | 15.3% |
| Space Separator | 1233 | 7.2% |
| Other Punctuation | 399 | 2.3% |
| Dash Punctuation | 23 | 0.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| t | 2012 | |
| i | 1640 | |
| a | 1309 | |
| e | 1303 | |
| n | 1294 | |
| o | 1227 | |
| l | 809 | |
| d | 484 | 3.8% |
| c | 467 | 3.7% |
| r | 451 | 3.5% |
| Other values (11) | 1771 |
| Value | Count | Frequency (%) |
| S | 1283 | |
| V | 409 | 15.7% |
| W | 399 | 15.3% |
| U | 399 | 15.3% |
| T | 48 | 1.8% |
| P | 27 | 1.0% |
| B | 11 | 0.4% |
| M | 6 | 0.2% |
| K | 5 | 0.2% |
| D | 5 | 0.2% |
| Other values (8) | 11 | 0.4% |
| Value | Count | Frequency (%) |
| 1233 |
| Value | Count | Frequency (%) |
| / | 399 |
| Value | Count | Frequency (%) |
| - | 23 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 15370 | |
| Common | 1655 | 9.7% |
Most frequent character per script
| Value | Count | Frequency (%) |
| t | 2012 | |
| i | 1640 | |
| a | 1309 | 8.5% |
| e | 1303 | 8.5% |
| n | 1294 | 8.4% |
| S | 1283 | 8.3% |
| o | 1227 | 8.0% |
| l | 809 | 5.3% |
| d | 484 | 3.1% |
| c | 467 | 3.0% |
| Other values (29) | 3542 |
| Value | Count | Frequency (%) |
| 1233 | ||
| / | 399 | 24.1% |
| - | 23 | 1.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17025 |
Most frequent character per block
| Value | Count | Frequency (%) |
| t | 2012 | |
| i | 1640 | 9.6% |
| a | 1309 | 7.7% |
| e | 1303 | 7.7% |
| n | 1294 | 7.6% |
| S | 1283 | 7.5% |
| 1233 | 7.2% | |
| o | 1227 | 7.2% |
| l | 809 | 4.8% |
| d | 484 | 2.8% |
| Other values (32) | 4431 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| CRASH DATE | CRASH TIME | BOROUGH | ZIP CODE | LATITUDE | LONGITUDE | LOCATION | ON STREET NAME | CROSS STREET NAME | OFF STREET NAME | NUMBER OF PERSONS INJURED | NUMBER OF PERSONS KILLED | NUMBER OF PEDESTRIANS INJURED | NUMBER OF PEDESTRIANS KILLED | NUMBER OF CYCLIST INJURED | NUMBER OF CYCLIST KILLED | NUMBER OF MOTORIST INJURED | NUMBER OF MOTORIST KILLED | CONTRIBUTING FACTOR VEHICLE 1 | CONTRIBUTING FACTOR VEHICLE 2 | CONTRIBUTING FACTOR VEHICLE 3 | CONTRIBUTING FACTOR VEHICLE 4 | CONTRIBUTING FACTOR VEHICLE 5 | COLLISION_ID | VEHICLE TYPE CODE 1 | VEHICLE TYPE CODE 2 | VEHICLE TYPE CODE 3 | VEHICLE TYPE CODE 4 | VEHICLE TYPE CODE 5 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 01/02/2020 | 0:00 | NaN | NaN | NaN | NaN | NaN | CROSS ISLAND PARKWAY | NaN | NaN | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | Tire Failure/Inadequate | NaN | NaN | NaN | NaN | 4267700 | Sedan | NaN | NaN | NaN | NaN |
| 1 | 01/02/2020 | 12:57 | NaN | NaN | NaN | NaN | NaN | W 57 & 8th Ave | W 57 | NaN | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | Unspecified | Unspecified | NaN | NaN | NaN | 4268255 | Taxi | Pick-up Truck | NaN | NaN | NaN |
| 2 | 01/02/2020 | 15:00 | NaN | NaN | 40.668266 | -73.842140 | (40.668266, -73.84214) | CROSS BAY BOULEVARD | SOUTH CONDUIT AVENUE | NaN | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | Driver Inattention/Distraction | Unspecified | NaN | NaN | NaN | 4268222 | Station Wagon/Sport Utility Vehicle | Taxi | NaN | NaN | NaN |
| 3 | 01/02/2020 | 15:10 | BROOKLYN | 11206.0 | 40.700527 | -73.941610 | (40.700527, -73.94161) | NaN | NaN | 760 BROADWAY | 1.0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | Pedestrian/Bicyclist/Other Pedestrian Error/Confusion | NaN | NaN | NaN | NaN | 4268246 | Sedan | NaN | NaN | NaN | NaN |
| 4 | 01/02/2020 | 17:30 | NaN | NaN | NaN | NaN | NaN | NORTHERN BOULEVARD | 68 STREET | NaN | 1.0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | Driver Inattention/Distraction | Driver Inattention/Distraction | NaN | NaN | NaN | 4268708 | Station Wagon/Sport Utility Vehicle | Sedan | NaN | NaN | NaN |
| 5 | 01/02/2020 | 20:45 | BRONX | 10460.0 | 40.843033 | -73.881805 | (40.843033, -73.881805) | NaN | NaN | 948 EAST 179 STREET | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | Passing Too Closely | Unspecified | NaN | NaN | NaN | 4268164 | Station Wagon/Sport Utility Vehicle | Station Wagon/Sport Utility Vehicle | NaN | NaN | NaN |
| 6 | 01/02/2020 | 10:10 | MANHATTAN | 10022.0 | 40.759740 | -73.974230 | (40.75974, -73.97423) | EAST 53 STREET | MADISON AVENUE | NaN | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | Other Vehicular | Other Vehicular | NaN | NaN | NaN | 4268253 | Sedan | Sedan | NaN | NaN | NaN |
| 7 | 01/02/2020 | 17:18 | NaN | NaN | 40.749550 | -74.006540 | (40.74955, -74.00654) | 11 AVENUE | NaN | NaN | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | Passing or Lane Usage Improper | Unspecified | NaN | NaN | NaN | 4268097 | Sedan | Station Wagon/Sport Utility Vehicle | NaN | NaN | NaN |
| 8 | 01/02/2020 | 18:50 | NaN | NaN | 40.811638 | -73.931600 | (40.811638, -73.9316) | MAJOR DEEGAN EXPRESSWAY | NaN | NaN | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | Unsafe Speed | Reaction to Uninvolved Vehicle | NaN | NaN | NaN | 4268521 | Sedan | Sedan | NaN | NaN | NaN |
| 9 | 01/02/2020 | 13:00 | BROOKLYN | 11226.0 | 40.653328 | -73.959404 | (40.653328, -73.959404) | NaN | NaN | 793 FLATBUSH AVENUE | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | Unspecified | NaN | NaN | NaN | NaN | 4268069 | Sedan | NaN | NaN | NaN | NaN |
Last rows
| CRASH DATE | CRASH TIME | BOROUGH | ZIP CODE | LATITUDE | LONGITUDE | LOCATION | ON STREET NAME | CROSS STREET NAME | OFF STREET NAME | NUMBER OF PERSONS INJURED | NUMBER OF PERSONS KILLED | NUMBER OF PEDESTRIANS INJURED | NUMBER OF PEDESTRIANS KILLED | NUMBER OF CYCLIST INJURED | NUMBER OF CYCLIST KILLED | NUMBER OF MOTORIST INJURED | NUMBER OF MOTORIST KILLED | CONTRIBUTING FACTOR VEHICLE 1 | CONTRIBUTING FACTOR VEHICLE 2 | CONTRIBUTING FACTOR VEHICLE 3 | CONTRIBUTING FACTOR VEHICLE 4 | CONTRIBUTING FACTOR VEHICLE 5 | COLLISION_ID | VEHICLE TYPE CODE 1 | VEHICLE TYPE CODE 2 | VEHICLE TYPE CODE 3 | VEHICLE TYPE CODE 4 | VEHICLE TYPE CODE 5 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 128357 | 03/06/2021 | 23:50 | MANHATTAN | 10013.0 | 40.721350 | -74.004650 | (40.72135, -74.00465) | CANAL STREET | WEST BROADWAY | NaN | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | Unspecified | Unspecified | NaN | NaN | NaN | 4396733 | Sedan | Station Wagon/Sport Utility Vehicle | NaN | NaN | NaN |
| 128358 | 03/06/2021 | 9:15 | BROOKLYN | 11218.0 | 40.649940 | -73.974010 | (40.64994, -73.97401) | NaN | NaN | 31 OCEAN PARKWAY | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | Other Vehicular | Unspecified | NaN | NaN | NaN | 4396709 | Sedan | NaN | NaN | NaN | NaN |
| 128359 | 03/06/2021 | 15:21 | MANHATTAN | 10024.0 | 40.783974 | -73.970310 | (40.783974, -73.97031) | CENTRAL PARK WEST | WEST 84 STREET | NaN | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | Driver Inattention/Distraction | Driver Inattention/Distraction | NaN | NaN | NaN | 4396899 | Van | Sedan | NaN | NaN | NaN |
| 128360 | 03/06/2021 | 14:40 | NaN | NaN | NaN | NaN | NaN | BRUCKNER EXPRESSWAY RAMP | NaN | NaN | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | Unsafe Lane Changing | Unspecified | NaN | NaN | NaN | 4397161 | Taxi | Sedan | NaN | NaN | NaN |
| 128361 | 03/06/2021 | 6:14 | BRONX | 10452.0 | 40.841960 | -73.915306 | (40.84196, -73.915306) | EAST 172 STREET | TOWNSEND AVENUE | NaN | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | Driver Inattention/Distraction | Unspecified | NaN | NaN | NaN | 4396610 | Sedan | NaN | NaN | NaN | NaN |
| 128362 | 03/06/2021 | 17:00 | MANHATTAN | 10031.0 | 40.822834 | -73.953710 | (40.822834, -73.95371) | NaN | NaN | 610 WEST 139 STREET | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | Unspecified | NaN | NaN | NaN | NaN | 4396982 | Station Wagon/Sport Utility Vehicle | NaN | NaN | NaN | NaN |
| 128363 | 03/06/2021 | 19:30 | BRONX | 10466.0 | 40.887096 | -73.860870 | (40.887096, -73.86087) | WHITE PLAINS ROAD | EAST 224 STREET | NaN | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | Driver Inattention/Distraction | Driver Inattention/Distraction | NaN | NaN | NaN | 4396676 | Sedan | NaN | NaN | NaN | NaN |
| 128364 | 03/06/2021 | 1:30 | BRONX | 10466.0 | 40.883410 | -73.837800 | (40.88341, -73.8378) | NaN | NaN | 3601 PALMER AVENUE | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | Unspecified | NaN | NaN | NaN | NaN | 4396673 | Dump | NaN | NaN | NaN | NaN |
| 128365 | 03/06/2021 | 16:30 | QUEENS | 11101.0 | 40.737537 | -73.929955 | (40.737537, -73.929955) | HUNTERS POINT AVENUE | GREENPOINT AVENUE | NaN | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | Driver Inattention/Distraction | Passing or Lane Usage Improper | NaN | NaN | NaN | 4396622 | Station Wagon/Sport Utility Vehicle | Sedan | NaN | NaN | NaN |
| 128366 | 03/06/2021 | 18:15 | BROOKLYN | 11212.0 | 40.654137 | -73.912340 | (40.654137, -73.91234) | LINDEN BOULEVARD | ROCKAWAY PARKWAY | NaN | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | Unspecified | NaN | NaN | NaN | NaN | 4397200 | Station Wagon/Sport Utility Vehicle | NaN | NaN | NaN | NaN |